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PREFACE TO THE SECOND EDITION 
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L. D. Landau 
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NOTATION 


Electric field E 

Electric induction D 

Magnetic field H 

Magnetic induction B 

External electric field G, magnitude G 

External magnetic field §, magnitude $ 

Dielectric polarization P 

Magnetization M 

Total electric moment of a body ^ 

Total magnetic moment of a body Ji 
Permittivity e 
Dielectric susceptibility k 
Magnetic permeability p 
Magnetic susceptibility x 
Current density j 
Conductivity a 

Absolute temperature (in energy units) T 
Pressure P 
Volume V 


Thermodynamic quantities: 

per unit volume 

for a body 

entropy 

S 

SF 

internal energy 

U 


free energy 

F 

3F 

thermodynamic potential 

O 



(Gibbs free energy) 

Chemical potential C 

A complex periodic time factor is always taken as 
Volume element dV or d^x; surface element df. 

The summation convention always applies to three-dimensional (Latin) and two- 
dimensional (Greek) suffixes occurring twice in vector and tensor expressions. 
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CHAPTER I 


ELECTROSTATICS OF CONDUCTORS 


§1. The electrostatic field of conductors 

Macroscopic electrodynamics is concerned with the study of electromagnetic fields in 
space that is occupied by matter. Like all macroscopic theories, electrodynamics deals with 
physical quantities averaged over elements of volume which are “physically infinitesimal”, 
ignoring the microscopic variations of the quantities which result from the molecular 
structure of matter. For example, instead of the actual “microscopic” value of the electric 
field e, we discuss its averaged value, denoted by E: 

e = E. (11) 

The fundamental equations of the electrodynamics of continuous media are obtained by 
averaging the equations for the electromagnetic field in a vacuum. This method of 
obtaining the macroscopic equations from the microscopic was first used by H. A. Lorentz 
(1902). 

The form of the equations of macroscopic electrodynamics and the significance of the 
quantities appearing in them depend essentially on the physical nature of the medium, and 
on the way in which the field varies with time. It is therefore reasonable to derive and 
investigate these equations separately for each type of physical object. 

It is well known that all bodies can be divided, as regards their electric properties, into 
two classes, conductors and dielectrics, differing in that any electric field causes in a 
conductor, but not in a dielectric, the motion of charges, i.e. an electric current.'^ 

Let us begin by studying the static electric fields produced by charged conductors, that is, 
the electrostatics of conductors. First of all, it follows from the fundamental property of 
conductors that, in the electrostatic case, the electric field inside a conductor must be zero. 
For a field E which was not zero would cause a current; the propagation of a current in a 
conductor involves a dissipation of energy, and hence cannot occur in a stationary state 
(with no external sources of energy). 

Hence it follows, in turn, that any charges in a conductor must be located on its surface. 
The presence of charges inside a conductor would necessarily cause an electric field in it;t 
they can be distributed on its surface, however, in such a way that the fields which they 
produce in its interior are mutually balanced. 

Thus the problem of the electrostatics of conductors amounts to determining the electric 
field in the vacuum outside the conductors and the distribution of charges on their surfaces. 

At any point far from the surface of the body, the mean field E in the vacuum is almost 


t The conductor is here assumed to be homogeneous (in composition, temperature, etc.). In an inhomoge¬ 
neous conductor, as we shall see later, there may be fields which cause no motion of charges, 
t This is clearly seen from equation (1.8) below. 
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2 Electrostatics of Conductors 

the same as the actual field e. The two fields differ only in the immediate neighbourhood of 
the body, where the effect of the irregular molecular fields is noticeable, and this difference 
does not affect the averaged field equations. The exact microscopic Maxwell s equations in 
the vacuum are 

dive = 0. (1-2) 

curie = -(l/c)dh/df, (1-3) 

where h is the microscopic magnetic field. Since the mean magnetic field is assumed to be 
zero, the derivative dh 'dt also vanishes on averaging, and we find that the static electric field 
in the vacuum satisfies the usual equations 

divE = 0, curlE = 0, (1-4) 

i.e. it is a potential field with a potential (p such that 

E=—gradd), (1-5) 

and (p satisfies Laplace’s equation 

A(p = 0. (1-6) 

The boundary conditions on the field E at the surface of a conductor follow from the 
equation curl E = 0, which, like the original equation (1.3), is valid both outside and inside 
the body. Let us take the z-axis in the direction of the normal n to the surface at some point 
on the conductor. The component of the field takes very large values in the immediate 
neighbourhood of the surface (because there is a finite potential difference over a very small 
distance). This large field pertains to the surface itself and depends on the physical 
properties of the surface, but is not involved in our electrostatic problem, because it falls off 
over distances comparable with the distances between atoms. It is important to note, 
however, that, if the surface is homogeneous, the derivatives dE^ldx, dE^jdy along the 
surface remain finite, even though E^ itself becomes very large. Hence, since 
(curlE),, = dEJdy-dE^ldz = 0, we find that dE^/dz is finite. This means that E^ is 
continuous at the surface, since a discontinuity in E^ would mean an infinity of the 
derivative dE^jdz. The same applies to E^, and since E = 0 inside the conductor, we reach 
the conclusion that the tangential components of the external field at the surface must be 
zero: 

E, = 0. (1-7) 

Thus the electrostatic field must be normal to the surface of the conductor at every point. 
Since E = - grad (p, this means that the field potential must be constant on the surface of 
any particular conductor. In other words, the surface of a homogeneous conductor is an 
equipotential surface of the electrostatic field. 

The component of the field normal to the surface is very simply related to the charge 
density on the surface. The relation is obtained from the general electrostatic equation 
div e = 47:p, which on averaging gives 

divE = 47i:p, (1-^) 

p being the mean charge density. The meaning of the integrated form of this equation is 
well known: the flux of the electric field through a closed surface is equal to the total charge 
inside that surface, multiplied by 4?:. Applying this theorem to a volume element lying 
between two infinitesimally close unit areas, one on each side of the surface of the 
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conductor, and using the fact that E = 0 on the inner area, we find that E„ = 47ia, where a is 
the surface charge density, i.e. the charge per unit area of the surface of the conductor. Thus 
the distribution of charges over the surface of the conductor is given by the formula 

4na = E„= -d4>/dn, (1-9) 


the derivative of the potential being taken along the outward normal to the surface. The 
total charge on the conductor is 


1 

47t 



( 1 . 10 ) 


the integral being taken over the whole surface. 

The potential distribution in the electrostatic field has the following remarkable 
property; the function </.(x, y, z) can take maximum and minimum values only at 
boundaries of regions where there is a field. This theorem can also be formulated thus; a 
test charge e introduced into the field cannot be in stable equilibrium, since there is no point 
at which its potential energy erf) would have a minimum. 

The proof of the theorem is very simple. Let us suppose, for example, that the potential 
has a maximum at some point A not on the boundary of a region where there is a held 
Then the point A can be surrounded by a small closed surface on which the norma 
derivative d<j)ldn<0 everywhere. Consequently, the integral over this surface 
§(d<j)ldn) d/<0. But by Laplace’s equation §{8(j)/dn)df = j A (l>dV = 0, giving a 
contradiction. 


§2. The energy of the electrostatic field of conductors 

Let us calculate the total energy of the electrostatic field of charged conductors,t 




( 2 . 1 ) 


where the integral is taken over all space outside the conductors. We transform this integral 
as follows; 

^ Je •grad</)dL= jdiv(</)E)dL + ^ J </> divEdL. 

The second integral vanishes by (1.4), and the first can be transformed into integrals over 
the surfaces of the conductors which bound the field and an integral over an infinitely 
remote surface. The latter vanishes, because the field diminishes sufficiently rapidly at 
infinity (the arbitrary constant in (p is assumed to be chosen so that </> = 0 at infinity). 
Denoting by </>„ the constant value of the potential on the ath conductor, we havef 


t The square is not the same a^the mean square of the actual field near the surface of a conductor or 
inside it (where E = 0 but, of course, # 0). By calculating the integral (2.1) we ignore the internal energy of the 
conductor as such, which is here of no interest, and the affinity of the charges for the surface. 

t In transforming volume integrals into surface integrals, both here and later, it must be borne in mind that £ 
is the component of the field along the outward normal to the conductor. This direction is opposite to that ol he 
outward normal to the region of the volume integration, namely the space outside the conductors. The sign of the 
integral is therefore changed in the transformation. 



4 Electrostatics of Conductors 

Finally, since the total charges on the conductors are given by (1.10) we obtain 

= ( 2 . 2 ) 

which is analogous to the expression for the energy of a system of point charges. 

The charges and potentials of the conductors cannot both be arbitrarily prescribed; 
there are certain relations between them. Since the field equations in a vacuum are linear 
and homogeneous, these relations must also be linear, i.e. they must be given by equations 
of the form 

ea = YCo,4>„ (2.3) 

where the quantities C^^, have the dimensions of length and depend on the shape and 
relative position of the conductors. The quantities are called coefficients of capacity, 
and the quantities ^ b] are called coefficients of electrostatic induction. In particular, 
if there is only one conductor, we have e = Ccj), where C is the capacitance, which in order 
of magnitude is equal to the linear dimension of the body. The converse relations, giving 
the potentials in terms of the charges, are 

= (2.4) 

where the coefficients form a matrix which is the inverse of the matrix 
Let us calculate the change in the energy of a system of conductors caused by an 
infinitesimal change in their charges or potentials. Varying the original expression (2.1), we 
have ^ = (1/4?:) J E • dF. This can be further transformed by two equivalent 

methods. Putting E = — grad (j) and using the fact that the varied field, like the original 
field, satisfies equations (1.4) (so that div^E = 0), we can write 

Jgradc^ -^EdF = -^ Jdiv(</>^E)dI 

that is 


8^ = Y,(f>Je^. (2.5) 

which gives the change in energy due to a change in the charges. This result is obvious; it is 
the work required to bring infinitesimal charges 8e^ to the various conductors from 
infinity, where the field potential is zero. 

On the other hand, we can write 

^ f E • grad SfdV = — ^ f div (E Sf) d 1 
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that 


The energy of the electrostatic field of conductors 


(2.6) 

which expresses the change in energy in terms of the change in the potentials of the 
conductors. 

Formulae (2.5) and (2.6) show that, by differentiating the energy with respect to the 
charges, we obtain the potentials of the conductors, and the derivatives of with respect 
to the potentials are the charges; 

8 ^jde^ = (f)^, 8 ‘^I8(j)„ = (2-7) 

But the potentials and charges are linear functions of each other. Using (2.3) we have 
8^'^/8(j)„8(l)t = 8eJ84>„ = and by reversing the order of differentiation we get 
Hence it follows that 

Q, = C,,, (2.8) 

and similarly = C“ The energy can be written as a quadratic form in either the 

potentials or the charges: 


= (2.9) 

a.b o,b 

This quadratic form must be positive definite, like the original expression (2.1). From 
this condition we can derive various inequalities which the coefficients must satisfy. In 
particular, all the coefficients of capacity are positive: 

Q. > 0 (2.10) 

(and also > 0).t 

All the coefficients of electrostatic induction, on the other hand, are negative. 

Qfc<0 {a^b). (2.11) 

That this must be so is seen from the following simple arguments. Let us suppose that 
every conductor except the ath is earthed, i.e. their potentials are zero. Then the charge 
induced by the charged ath conductor on another (the fcth, say) is e^, = It is obvious 

that the sign of the induced charge must be opposite to that of the inducing potential, and 
therefore < 0. This can be more rigorously shown from the fact that the potential of the 
electrostatic field cannot reach a maximum or minimum outside the conductors. For 
example, let the potential of the only conductor not earthed be positive. Then the 
potential is positive in all space, its least value (zero) being attained only on the earthed 
conductors. Hence it follows that the normal derivative 8(f)/8n of the potential on the 
surfaces of these conductors is positive, and their charges are therefore negative, by (1.10). 
Similar arguments show that > 0. 

The energy of the electrostatic field of conductors has a certain extremum property, 
though this property is more formal than physical. To derive it, let us suppose that the 


t We may also mention that another inequality which 
CaaCt.„ > 


be satisfied if the form (2.9) is 
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charge distribution on the conductors undergoes an infinitesimal change (the total charge 
on each conductor remaining unaltered), in which the charges may penetrate into the 
conductors; we ignore the fact that such a charge distribution cannot in reality be stationary. 
We consider the change in the integral = (1/8?:) J E^dV, which must now be extended 
over all space, including the volumes of the conductors themselves (since after the 
displacement of the charges the field E may not be zero inside the conductors). We write 

S‘^= Jgradc/)-^EdF 

= — — Jdiv {(f>SE)dV + ~ ^(f) di\ 5E dV. 

The first integral vanishes, being equivalent to one over an infinitely remote surface. In the 
second integral, we have by (1.8) div^E = 4n5p, so that S'W = J</)^pdF. This integral 
vanishes if </> is the potential of the true electrostatic field, since then (p is constant inside 
each conductor, and the integral \dpdV over the volume of each conductor is zero, since 
its total charge remains unaltered. 

Thus the energy of the actual electrostatic field is a minimum! relative to the energies of 
fields which could be produced by any other distribution of the charges on or in the 
conductors {Thomson's theorem). 

From this theorem it follows, in particular, that the introduction of an uncharged 
conductor into the field of given charges (charged conductors) reduces the total energy of 
the field. To prove this, it is sufficient to compare the energy of the actual field resulting 
from the introduction of the uncharged conductor with the energy of the fictitious field in 
which there are no induced charges on that conductor. The former energy, since it has the 
least possible value, is less than the latter energy, which is also the energy of the original 
field (since, in the absence of induced charges, the field would penetrate into the conductor, 
and remain unaltered). This result can also be formulated thus: an uncharged conductor 
remote from a system of given charges is attracted towards the system. 

Finally, it can be shown that a conductor (charged or not) brought into an electrostatic 
field cannot be in stable equilibrium under electric forces alone. This assertion generalizes 
the theorem for a point charge proved at the end of §1, and can be derived by combining the 
latter theorem with Thomson’s theorem. We shall not pause to give the derivation in detail. 

Formulae (2.9) are useful for calculating the energy of a system of conductors at finite 
distances apart. The energy of an uncharged conductor in a uniform external field 6, which 
may be imagined as due to charges at infinity, requires special consideration. According to 
(2.2), this energy is = jetp, where e is the remote charge which causes the field, and (p is 
the potential at this charge due to the conductor. ^ does not include the energy of the 
charge e in its own field, since we are interested only in the energy of the conductor. The 
charge on the conductor is zero, but the external field causes it to acquire an electric dipole 
moment, which we denote by The potential of the electric dipole field at a large distance 
r from it is 0 = ^ -r/r^. Hence = e ^ • r/2/-^. But —et/ r^ is just the field C due to the 
charge e. Thus 

(2.12) 

t We shall not give here the simple arguments which demonstrate that the extremum is a minimum. 
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Since all the field equations are linear, it is evident that the components of the dipole 
moment ^ are linear functions of the components of the field G. The coefficients of 
proportionality between ^ and G have the dimensions of length cubed, and are therefore 
proportional to the volume of the conductor: 

(2.13) 


where the coefficients depend only on the shape of the body. The quantities form 
a tensor, which may be called the polarizability tensor of the body. This tensor is 
symmetrical: a;^ = a statement which will be proved in §11. Accordingly, the energy 
(2.12) is 

<2^=(2.14) 


PROBLEMS 


Problem 1. Express the mutual capacitance C of two conductors (with charges ±f>) in terms of the 
coefficients C„t. 

Solution. The mutual capacitance of two conductors is defined as the coefficient C in the relation 
e = C{<j>2 — <j>i), and the energy of the system is given in terms of C by ‘it = Comparing with (2.9), 

we obtain 

1/C = C-‘„-2C-‘,2 + C-‘22 

= (C„+2C,2+C22)/(C,,C22-C,2^). 

Problem 2. A point charge e is situated at 0, near a system of earthed conductors, and induces on them 
charges f>„. If the charge e were absent, and the oth conductor were at potential the remainder being earthed, 
the field potential at 0 would be tp'o. Express the charges in terms of <i>\ and <j>'a- 

Solution. If charges on the conductors give them potentials <t>^, and similarly for and it follows 
from (2.3) that 

X <Pce'c = X 


We apply this relation to two states of the system formed by all the conductors and the charge e (regarding the 
latter as a very small conductor). In one state the charge e is present, the charges on the conductors are and their 
potentials are zero. In the other state the charge e is zero, and one of the conductors has a potential <!>'„ # 0. Then 
we have etp'o + e^tp'^ = 0, whence e„= — etp'^^/tp'^. 

For example, if a charge e is at a distance r from the centre of an earthed conducting sphere with radius a (< r), 
then d>'o = and the charge induced on the sphere is = —ea/r. 

As a second example, let us consider a charge e placed between two concentric conducting spheres with radii o 
and b, at a distance r from the centre such that o < r < b. If the outer sphere is eanhed and the inner one is 
charged to potential <j>\, the potential at distance r is 


Hence the charge induced on the inner sphere by the charge e is = — ea(b — r)/r(b — a). Similarly the charge 
induced on the outer sphere is = —eb{r — a)/r{b—a). 


Problem 3. Two conductors, with capacitances Ci and C2, are placed at a distance r apart which is large 
compared with their dimensions. Determine the coefficients C„h- 


Solution. If conductor I has a charge e„ and conductor 2 is uncharged, then in the first approximation 
d>i = Ci/C,, (l>2 = e^lr-, here we neglect the variation of the field over conductor 2 and its polarization. Thus 
C = 1/Ci, C-‘,2 = 1/r, and similarly €“‘22 = I/C2. Hence we findt 


C,2=C2 1 + 


t The subsequent terms in the expansion are in general of order (in 1/r) one higher than those given. If, 
however, r is taken as the distance between the “centres of charge” of the two bodies (for spheres, between the 
geometrical centres), then the order of the subsequent terms is two higher. 
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Problem 4. Determine the capacitance of a ring (radius b) of thin conducting wire of circular cross-section 
(radius a <? b). 


Solution. Since the wire is thin, the field at the surface of the ring is almost the same as that of charges 
distributed along the axis of the wire (for a right cylinder, it would be exactly the same). Hence the potential of the 
ring is 



where r is the distance from a point on the surface of the ring to an element d/ of the axis of the wire, the 
integration being over all such elements. We divide the integral into two parts corresponding to r < A and r > A, 
A being a distance such that a<^ A<^b. Then for r < A the segment of the ring concerned may be regarded as 
straight, and therefore 




^ 2 log(2A/a). 


In the range r > A the thickness of the wire may be neglected, i.e. r may be taken as the distance between two 
points on its axis. Then 


b(i<l> 

2b sin j<l> 


— 21ogtanj<^io, 


where (p is the angle subtended at the centre of the ring by the chord r, and the lower limit of integration is such 
that 2b sin ^<^0 = A, whence = A/b. When the two parts of the integral are added, A cancels, and the 
capacitance of the ring is 


C 




Tib 

log(8b/a) ■ 


§3. Methods of solving problems in electrostatics 

The general methods of solving Laplace’s equation for given boundary conditions on 
certin surfaces are studied in mathematical physics, and we shall not give a detailed 
description of them here. We shall merely mention some of the more elementary 
procedures and solve various problems of intrinsic interest.t 


(1) The method of images. The simplest example of the use of this method is to determine 
the field due to a point charge e outside a conducting medium which occupies a half-space. 
The principle of the method is to find fictitious point charges which, together with the given 
charge or charges, produce a field such that the surface of the conductor is an equipotential 
surface. In the case just mentioned, this is achieved by placing a fictitious charge e' = -eat 
a point which is the image of e in the plane which bounds the conducting medium. The 
potential of the field due to the charge e and its image e' is 


4 > 



(3.1) 


t The solutions of many more complex problems are given by W. R. Smythe, Static and Dynamic Electricity, 
3rd ed., McGraw-Hill, New York, 1968; G. A. Grinberg, Selected Problems in the MathematicalTheory of Electric 
and Magnetic Phenomena (Izbrannye voprosy matematicheskdi teorii elektricheskikh i magnitnykh yavlenil), 
Moscow, 1948. 
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where r and r' are the distances of a point from the charges e and d. On the bounding plane, 
r = r' and the potential has the constant value zero, so that the necessary boundary 
condition is satisfied and (3.1) gives the solution of the problem. It may be noted that the 
charge e is attracted to the conductor by a force e^l(2af (the image force-, a is the distance 
of the charge from the conductor), and the energy of their interaction is -c^/4a. 

The distribution of surface charge induced on the bounding plane by the point charge e 
is given by 



It is easy to see that the total charge on the plane is Jud/ = -e, as it should be. 

The total charge induced on an originally uncharged insulated conductor by other 
charges is, of course, zero. Hence, if in the present case the conducting medium (in reality a 
large conductor) is insulated, we must suppose that, besides the charge — e, a charge + c is 
also induced, which, however, has a vanishingly small density, being distributed over the 
large surface of the conductor. 

Next, let us consider a more difficult problem, that of the field due to a point charge e 
near a spherical conductor. To solve this problem, we use the following result, which can 
easily be proved by direct calculation. The potential of the field due to two point charges c 
and - e', namely ^ = ejr- e'jr', vanishes on the surface of a sphere whose centre is on the 
line joining the charges (but not between them). If the radius of the sphere is R and its 
centre is distant / and /' from the two charges, then ///' = {eje'f-, = ll. 

Let us first suppose that the spherical conductor is maintained at a constant potential 
(p = 0, i.e. it is earthed. Then the field outside the sphere due to the point charge e at A 
(Fig. 1), at a distance / from the centre of the sphere, is the same as the field due to two 
charges, namely the given charge e and a fictitious charge — d at A inside the sphere, at a 
distance /' from its centre, where 

/' = R^ll, d = cRIl. (3.3) 


The potential of this field is 


Id' 


(3.4) 


r and d being as shown in Fig. 1. A non-zero total charge — d is induced on the surface of 
the sphere. The energy of the interaction between the charge and the sphere is 

qi= -\ed I (I -/')=- (3.5) 

and the charge is attracted to the sphere by a force F = —d'^/dl^ —e^lR/{P — R^)^. 

If the total charge on the spherical conductor is kept equal to zero (an insulated 
uncharged sphere), a further fictitious charge must be introduced, such that the total charge 
induced on the surface of the sphere is zero, and the potential on that surface is still 
constant. This is done by placing a charge + d at the centre of the sphere. The potential of 
the required field is then given by the formula 


(3.6) 
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The energy of interaction in this case is 

Finally, if the charge e is at A' (Fig. 1) in a spherical cavity in a conducting medium, the 
field inside the cavity must be the same as the field due to the charge e at A' and its image at 
A outside the sphere, regardless of whether the conductor is earthed or insulated: 


eR 

Iv ■ 


(3.8) 


(2) The method of inversion. There is a simple method whereby in some cases a known 
solution of one electrostatic problem gives the solution of another problem. This method is 
based on the invariance of Laplace’s equation with respect to a certain transformation of 
the variables. 

In spherical polar coordinates Laplace’s equation has the form 


dr\ 


— 0 , 


where denotes the angular part of the Laplacian operator. It is easy to see that this 
equation is unaltered in form if the variable r is replaced by a new variable r such that 


r = R^jr' (3.9) 

(the inversion transformation) and at the same time the unknown function <}) is replaced by 
(!>' such that 

(j) = r<j)'/R. (3.10) 


Here R is some constant having the dimensions of length (the radius of inversion). Thus, if 
the function <^(r) satisfies Laplace’s equation, then so does the function 

(i>'(r') = R<P(R^i'/r'^)/r'. (3.11) 

Let us assume that we know the electrostatic field due to some system of conductors, all 
at the same potential and point charges. The potential fir) is usually defined so as to 
vanish at infinity. Here, however, we shall define 4>{r) so that it tends to — 4>o at infinity. 
Then (/> = 0 on the conductors. 
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We may now ascertain what problem of electrostatics will be solved by the transformed 
function (3.11). First of all, the shapes and relative positions of all the conductors of finite 
size will be changed. The boundary condition of constant potential on their surfaces will be 
automatically satisfied, since (^' = 0 if <^ = 0. Furthermore, the positions and magnitudes 
of all the point charges will be changed. A charge c at a point Tq moves to r'^ = R^r^lrf^ 
and takes a value e' which can be determined as follows. As r ^ Tf, the potential <^(r) tends 
to infinity as c/|^r|, where ^r = r-ro. Differentiating the relation r = we 

find that the magnitudes of the small differences and = r'-Ff, are related by 
(brf = R'^(8r'flr'(f. Hence, as F ^ Fq, the function (j)' tends to infinity as 
ei?/Fo I^rl = cFo//?|^F|, corresponding to a charge 

F = er'JR = cR/r^. (3.12) 

Finally, let us examine the behaviour of the function <^'(F) near the origin. For F = 0 we 
have r^oo and <^(r)^ -</)o. Hence, as F^O, the function <^' tends to infinity as 
-Rtpoir'. This means that there is a charge = - Rtp^ at the point F = 0. 

We shall give, for reference, the way in which certain geometrical figures are trans¬ 
formed by inversion. A spherical surface with radius a and centre Tq is given by the 
equation (r —Tq)^ = a^. On inversion, this becomes ([R^F/F^] — = a^, which, on 

multiplying by F^ and rearranging, can be written (F -F,,)^ = a'^, where 

Fo=-KV(«"-0, a'^aR^/\a^-r^^\. (3.13) 

Thus we have another sphere, with radius a’ and centre Fq. If the original sphere passes 
through the origin (a = r,,), then a' = oo. In this case the sphere is transformed into a plane 
perpendicular to the vector Tq and distant rf — a' = R^l{a + rf} = R^/2a from the origin. 

(3) The method of conformal mapping. A field which depends on only two Cartesian co¬ 
ordinates (x and y, say) is said to be two-dimensional. The theory of functions of a complex 
variable is a powerful means of solving two-dimensional problems of electrostatics. The 
theoretical basis of the method is as follows. 

An electrostatic field in a vacuum satisfies two equations: curl E = 0, div E = 0. The first 
of these makes it possible to introduce the field potential, defined by E = — grad <j). The 
second equation shows that we can also define a vector potential A of the field, such that 
E = curl A. In the two-dimensional case, the vector E lies in the xy-plane, and depends only 
on X and y. Accordingly, the vector A can be chosen so that it is perpendicular to the xy- 
plane. Then the field components are given in terms of the derivatives of <}) and A by 

E^= - dtjy/dx = dA/dy, E^= - d<j)ldy = - dAjdx. (3.14) 

These relations between the derivatives of <j) and A are. mathematically, just the well- 
known Cauchy-Riemann conditions, which express the fact that the complex quantity 

w = <j>-iA (3.15) 

is an analytic function of the complex argument z = x -I- iy. This means that the function 
h(z) has a definite derivative at every point, independent of the direction in which 
the derivative is taken. For example, differentiating along the x-axis, we find 
dw/dz = df/Bx - iBA/Bx. or ^ ^ 

The function w is called the complex potential. 



12 Electrostatics of Conductors 

The lines of force are defined by the equation AxjE^ = dyjE^. Expressing E^ and E^ as 
derivatives of A, we can write this as {dAldx)dx + {dAldy)diy = d^l = 0, whence A{x, y) 
= constant. Thus the lines on which the imaginary part of the function w(z) is constant are 
the lines of force. The lines on which its real part is constant are the equipotential lines. The 
orthogonality of these families of lines is ensured by the relations (3.14), according to which 

d<j) dA d(j) dA 
dx 8x ^ dy dy 

Both the real and the imaginary part of an analytic function w(z) satisfy Laplace’s 
equation. We could therefore equally well take im w as the field potential. The lines of force 
would then be given by re w = constant. Instead of (3.15) we should have w = A + itf). 

The flux of the electric field through any section of an equipotential line is given by the 
integral f £„d/ = — §{d(j)ldn)dl, where d/ is an element of length of the equipotential line 
and n the direction of the normal to it. According to (3.14) we have d(j)/dn = - dA/dl, the 
choice of sign denoting that / is measured to the left when one looks along n. Thus f E„dl 
= §{dA/dl)dl = /l 2 — >li, where/I 2 andA^ are the values of/4 at the ends of the section. In 
particular, since the flux of the electric field through a closed contour is 4ne, where e is the 
total charge enclosed by the contour (per unit length of conductors perpendicular to the 
plane), it follows that 

e={\|A^i)^A, (3.17) 

where AA is the change in A on passing counterclockwise round the closed equipotential 
line. 

The simplest example of the complex potential is that of the field of a charged straight 
wire passing through the origin and perpendicular to the plane. The field is given by 
— 2e/r, Eg = 0, where r, 6 are polar coordinates in the xy-plane, and e is the charge per 
unit length of the wire. The corresponding complex potential is 

w = —2elogz = -2elogr-2iee. (3.18) 

If the charged wire passes through the point (Xq, y^) instead of the origin, the complex 
potential is 


w = -2elog(z-Zo), (3.19) 

where z^ = Xq + iy^- 

Mathematically, the functional relation w = w(z) constitutes a conformal mapping of the 
plane of the complex variable z on the plane of the complex variable w. Let C be the cross- 
sectional contour of a conductor in the xy-plane, and <^o if® potential. It is clear from the 
above discussion that the problem of determining the field due to this conductor amounts 
to finding a function w(z) which maps the contour C in the z-plane on the line w = 
parallel to the axis of ordinates, in the w-plane. Then re w gives the potential of the field. (If 
the function w(z) maps the contour C on a line parallel to the axis of abscissae, then the 
potential is im w.) 


(4) The wedge problem. We shall give here, for reference, formulae for the field due to a 
point charge e placed between two intersecting conducting half-planes. Let the z-axis of a 
system of cylindrical polar coordinates (r, 6, z) be along the apex of the wedge, the angle 6 
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Fig. 2 


being measured from one of the planes, and let the position of the charge e be (a, y, 0) 
(Fig. 2). The angle a between the planes may be either less or greater than 7t; in the latter 
case we have a charge outside a conducting wedge. 

The field potential is given by 


ay/ {2ar) 




[ sinh(7tC/a) 

sinh(7tC/a) 

|cosh(7t^/a) —cos [7t(6 —'y)/a] 

cosh(7t^/a) — cos [7t(6 + 'y)/a] 


dC 

^ (cosh C —cosh t]) 


cosh r] — (a^ + r^ + z^)j2ar, jy > 0. 


(3.20) 


The potential (p = 0 on the surface of the conductors, i.e. for 6 = 0 or a. This formula was 
first given by H. M. Macdonald (1895)t. 

In particular, for a = 27t we have a conducting half-plane in the field of a point charge. In 
this case the integral in (3.20) can be evaluated explicitly, giving 


p = cos 


/ -cosi(e-y) \ 

\ cosh^i; / 


r 2 = a^ + r^ + z^- 2ar cos()- - 6), 
R'^ = a^ + r^ + z^- 2ar cos(y + 6). 


Y -cos^(e + y) \] 
V cosh^iy Jy 


In the limit as the point (r, 6, z) tends to the position of the charge e, the potential (3.21) 
becomes 

(}> = <j)'+ e/R, where <^'= (3.22) 

2na L sin y J 

The second term is just the Coulomb potential, which becomes infinite as ^ 0, while (j)' is 
the change caused by the conductor in the potential at the position of the charge. The 
energy of the interaction between the charge and the conducting half-plane is 


^ = je4>' = 



(3.23) 


t Its derivation is given by him in Electromagnetism, Bell, London, 1934, p. 79, and by V. V. Batygin and 
I. N. Toptygin, Problems in Electrodynamics, 2nd ed.. Academic Press, London, 1978. p. 47. 
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PROBLEMS 


Problem 1. Determine the field near an uncharged conducting sphere with radius R placed in a uniform 
external electric field (f. 

Solution. We write the potential in the form <l> = <l>o +<l>i, where <t>o= - (f • r is the potential of the 
external field and <l>i is the required change in potential due to the sphere. By symmetry, the function <^i, can 
depend only on the constant vector (f. The only such solution of Laplace’s equation which vanishes at infinity is 

<^, = —constant x (f • grad(l/r) = constant x (f • r/r^, 

the origin being taken at the centre of the sphere. On the surface of the sphere <l> must be constant, and so the 
constant in <^i, is R^, whence 



where 6 is the angle between (f and r. The distribution of charge on the surface of the sphere is given by 
a = -(l/4K)[d<l>/drl^K = (3G/47i) cosfi. 

The total charge e = 0. The dipole moment of the sphere is most easily found by comparing <j>, with the potential 
^ • r/r^ of an electric dipole field, whence ^ = R^tS. 

Problem 2. The same as Problem 1, but for an infinite cylinder in a uniform transverse field. 

Solution. We use polar coordinates in a plane perpendicular to the axis of the cylinder. The solution of the 
two-dimensional Laplace’s equation which depends only on a constant vector is 
<j>i = constant x G • grad(logr) = constant x i£ ■ rjr^. 

Adding <j>o= — G • r and putting the constant equal to R}, we have 

<t> = — Grcosfi^l- 

The surface charge density is <7 = ( (£/2n) cos 6. The dipole moment per unit length of the cylinder can be found by 
comparing <j> with the potential of a two-dimensional dipole field, namely 20’ ■ grad(logr) = 20 ■ r/r^, so that 
0 = iR^(£. 

Problem 3. Determine the field near a wedge-shaped projection on a conductor. 

Solution. We take polar coordinates r, 6 in a plane perpendicular to the apex of the wedge, the origin being 
at the vertex of the angle fio of the wedge (Fig. 3). The angle 6 is measured from one face of the wedge, the region 
outside the conductor being 0 < 6 < 27t — 6^. Near the apex of the wedge the potential can be expanded in powers 
of r, and we shall be interested in the first term of the expansion (after the constant term), which contains the 
lowest power of r. The solutions of the two-dimensional Laplace’s equation which are proportional to r" are 
r" cos ti6 and r" sin n6. The solution having the smallest n which satisfies the condition (j) = constant for 6 = 0 and 
6 = 2t:—6o (i.e. on the surface of the conductor) is 

(j) = constant x r" sin n6, n = T:/(2n — 6 q). 

The field varies as r""*. For fio < ’ll” < 1), therefore, the field becomes infinite at the apex of the wedge. In 
particular, for a very sharp wedge (fio <? 1, n = i) £ increases as r“ i as r -► 0. Near a wedge-shaped concavity in a 
conductor (Go > 7t, n > 1) the field tends to zero. 



Fig. 3 
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The value of the constant can be determined only by solving the problem for the whole field. For example, for a 
very sharp wedge in the field of a point charge e, the passage to the limit of small r in (3.21) confirms that 

<l> = constant x fr sin jO, 

the constant being [Aejaln(a^ +z^)'[sm\y. In this case, “near the wedge” means that a, under which 
condition the d^<l>ldz^ term in Laplace’s equation may be neglected. 

Problem 4. Determine the field near the end of a sharp conical point on the surface of a conductor. 

Solution. We take spherical polar coordinates, with-the origin at the vertex of the cone and the polar axis 
along the axis of the cone. Let the angle of the cone be 200 < 1, so that the region outside the conductor 
corresponds to polar angles in the range 6^ $ 6 ^ rt. As in Problem 3, we seek a solution for the variable part of 
the potential, which is symmetrical about the axis, in the form 

<l> = ( 1 ) 


with the smallest possible value of ti. Laplace’s equation 

fir \ fir / r^ sin 6 d6\ 86/ 


after substitution of (1), gives 


1 d 
sin 6 d6 




+ n(n + l)f=0. 


(2) 


The condition of constant potential on the surface of the cone means that we must have /(6o) = 0- 
For small 6^ we seek a solution by assuming that I and f{6) is of the form constant x [1 + \l/{6) ], where 
Ip 1. (For 6q -► 0, i.e. an infinitely sharp point, we should expect-that ip tends to a constant almost everywhere 
le.) The equation for \p is 

(3) 


J_Afsine^)=-n. 
sinfideV de/ 


The solution having no singularities outside the cone (in particular, at 0 = rt) is ipiO) = 2n logsin50. 

For 0 ~ 00 « 1, i/' is no longer small. Nevertheless, this expression remains valid, since the second term in 
equation (2) may be neglected because 0 is small. To determine the constant n in the first approximation we must 
require that the function f=l+ip vanish for 0 = 0o. Thust n = -1/2 log 0o. The field increases to infinity as 
r"” in the neighbourhood of the vertex, i.e. essentially as 1/r. 


Problem 5. The same as Problem 4, but for a sharp conical depression on the surface of a conductor. 


Solution. The region outside the conductor now corresponds to the range 0 ^ 0 ^ 0o. As in Problem 4, we 
seek <p in the form (1), but now n > 1. Since 0 « 1 for all points in the field, equation (2) becomes 


iA 

0d0 


This is Bessel’s equation, and the solution having no singularities in the field is (n0). The value of n is determined 
as the smallest root of the equation JoivBo) = 0, whence n == 2.4/0o. 

Problem 6. Determine the energy of the attraction between an electric dipole and a plane conducting 
surface. 

Solution . We take the x-axis perpendicular to the surface of the conductor, and passing through the dipole; 
let the dipole moment vector ^ lie in the xy-plane. The image of the dipole is at the point — x and has a moment 
= - 0'^. The required energy of attraction is the energy of the interaction between the dipole and 
its image, and is - (2 

Problem 7. Determine the mutual capacitance per unit length of two parallel infinite conducting cylinders 
with radii a and b, their axes being at a distance c apart.J 


t A more exact formula n = 1/2 log (2/0o), containing a coefficient in the (large) logarithm, cannot really be 
obtained by the simple method given here. A more rigorous calculation, however, leads, as it happens, to this same 
formula. 

t The corresponding problem for two spheres cannot be solved in closed form. The difference arises because, 
in the field of two parallel wires bearing equal and opposite charges, all the equipotential surfaces are circular 
cylinders, whereas in the field of two equal and opposite point charges the equipotential surfaces are not spheres. 
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Solution. The field due to the two cylinders is the same as that which would be produced (in the region 
outside the cylinders) by two charged wires passing through certain points A and A' (Fig. 4). The wires have 
charges + e per unit length, equal to the charges on the cylinders, and the points A and A' lie on 00' in such a way 
that the surfaces of the cylinders are equipotential surfaces. For this to be so, the distances OA and O'A' must be 
such that OA • OA' = a^. O'A' ■ O'A = b^, i.e. di(c -dj) = a^, 1(2(0 -d,) = b^. Then, for each cylinder, the ratio 
r/r' of the distances from A and A' is constant. On cylinder 1, r/r' = ajOA' = a/(c — dj) = d, /a, and on cylinder 2, 
r'/r = dj/b. Accordingly, the potentials of the cylinders are <^, = -2elog(r/r') = — 2elog(di/a), 

= 2elog(d2/b), <l>2 — <l>i = 2elog(djd2/ab). Hence we find the required mutual capacitance C = e/{<j>2-<j>i): 

1/C = 21og(d,d2/ab) = 2cosh“‘ [(c^-a^-b^)/2ab]. 

In particular, for a cylinder with radius a at a distance h{> a) from a conducting plane, we put c = b + b and 
take the limit as b -► 00, obtaining 1/C = 2 cosh“‘(b/a). 

If two hollow cylinders are placed one inside the other (c < b —a), there is no field outside, while the field 
between the cylinders is the same as that due to two wires with charges ± e passing through A and A' (Fig. 5). The 
same method gives 

1/C = 2 cosh -' [(a^ + b^ - c^)/2ab ]. 



Problem 8. The boundary of a conductor is an infinite plane with a hemispherical projection. Determine the 
charge distribution on the surface. 

Solution. In the field determined in Problem 1, whose potential is 
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the plane z = 0 witn a projection r = R is an equipotential surface, on which <j) = 0. Hence it can be the surface of 
a conductor, and the above formula gives the field outside the conductor. The charge distribution on the plane 
part of the surface is given by 




we have taken the constant in ^ as — inOf,, so that 0^ is the charge density far from the projection. On the surface 
of the projection we have 




Problem 9. Determine the dipole moment of a thin conducting cylindrical rod, with length 21 and radius 
a <11, in an electric field G parallel to its axis. 


Solution. Let t(z) be the charge per unit length induced on the surface of the rod, and z the coordinate along 
the axis of the rod. measured from its midpoint. The condition of constant potential on the surface of the 
conductor is 


-Gz- 


J T(z')dz'd<j> 


R^ = (z'-z)^+4a^sinH<^, 

where <j) is the angle between planes passing through the axis of the cylinder and through two points on its surface 
at a distance R apart. We divide the integral into two parts, putting t(z') = t(z) + [t(z') - t(z) ]. Since 1 > a, we 
have for points not too near the ends of the rod 




T(Z) f , 1^ 


using the result that f " log sin <j) d<j) = —n log 2. In the integral which contains the difference t(z') — t(z), we can 
neglect the term in R. since it no longer causes the integral to diverge. Thus 


Gz = T(z)log4(P -z^)/a^ + 


dz'. 


The quantity r is almost proportional to z, and in this approximation the integral gives — 2t(z), the result being 
Gz 

^*^^“log[4(P-z^)/a^]-2' 

This expression is invalid near the ends of the rod. but in calculating the dipole moment that region is 
unimportant. In the above approximation we have 



where L = log (21/a) — 1 is large, or (with the same accuracy) 

^ Gl^ 

“31og(4//a)-7- 

Problem 10. Determine the capacitance of a hollow conducting cap of a sphere. 

Solution. We take the origin O at a point on the rim of the cap (Fig. 6), and carry out the inversion 
transformation r = P:r', where 1 is the diameter of the cap. The cap then becomes the half-plane shown by the 
dashed line in Fig. 6, which is perpendicular to the radius ,40 of the cap and passes through the point B on its rim. 
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Fig. 6 


The angle y = n —6, where 26 is the angle subtended by the diameter of the cap at the centre of the sphere. 

If the charge on the cap is e and its potential is taken as zero, then as r -► co the potential </>-> -</>o + <’/''■ 
Accordingly, in the transformed problem, as r' -► 0 the potential is <^' -► l^/r' = - l^o/r' + where the first 
term corresponds to a charge e' = - 14 >q at the origin. 

According to formula (3.22) we have 


—1 
sine/ 


(the potential near a charge e' at a distance / from the edge of a conducting half-plane at zero potential) 
Comparing the two expressions, we have for the required capacitance C = e/0o 


where R is the radius of the cap 

Problem 11. Determine the correction due to edge effects on the value C = SjAnd for the capacitance of i 
plane capacitor (S being the area of the plates, and d the distance between them). 


Solution . Since the plates have free edges, the distribution of charge over them is not uniform. To determine 
the required correction in a first approximation, we consider points which are at distances x froin the edge such 
that d X « fS. For example, taking the upper layer (at potential ^ F'g- ’^s) and neglecting its distance 

jd from the midplane (the equipotential surface ^ = 0), we have the problem of the field near the boundary 
between two parts of a plane having different potentials (Fig. 7b). The solution is elementaryt. and the excess 
charge density (relative to the value of o far from the edge) is Ao = EJ4n = so that the total excess 

charge is l]A adx = S/d), where L is the perimeter of the plate In calculatmg the 

logarithmically divergent integral, we have taken the limits as those of the region d-4 x -4 fS. Hence we find the 
capacitance 





See §23. I" formula (23.2) for the potential 


here put (j)„^ = i4>o- 
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A more exact calculation (determining the coefficient in the argument of the logarithm) demands considerably 
more elaborate methods, and the result depends the shape of the plates. If these are circular, with radius R, we 
obtain Kirchhoff's formula 


C = 


R 

4d ^ An 


167tR 

d 


■) 


§4. A conducting ellipsoid 

The problem of the field of a charged conducting ellipsoid and that of an ellipsoid in a 
uniform external field are solved by the use of ellipsoidal eoordinates. These are related to 
Cartesian coordinates by the equation 


- = 1 (a>b>e). 


(4.1) 


This equation, a cubic in u, has three different real roots t], C, which lie in the following 
ranges; 

-e^^ri^-b\ (4.2) 

These three roots are the ellipsoidal coordinates of the point x, y, z. Their geometrical 
significance is seen from the fact that the surfaces of constant rj and C are respectively 
ellipsoids and hyperboloids of one and two sheets, all confocal with the ellipsoid 


x^/a^+y^/b^+zye^=L 


(4.3) 


One surface of each of the three families passes through each point in space, and the 
three surfaces are orthogonal. The formulae for transformation from ellipsoidal to 
Cartesian coordinates are given by solving three simultaneous equations of the type (4.1), 
and are 


-VL J’ 

l[ {i + b^){t1 + b^M + b^) l 

VL {c^-b^){a^ -b^) J’ 

z=+ 

-VL («^-cW-c^) J- 

The element of length in ellipsoidal coordinates is 


(4.4) 


d/^ = ^ Vd/?^ + 

bi = y/u^- m - C)]/2R^, /i2 = x/ [(>/ - C)(f? - m/2Rr,, 
bs = = (u + a^){u + b^)(u + e% 

U = Lr},i- 

Accordingly, Laplace’s equation in these coordinates is 


A</) =- 


“(^-»?)(C-0(»?-0 
X [(-I - f >"4 ) +«- - «■ («) 
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If two of the semiaxes a, b, c become equal, the system of ellipsoidal coordinates 
degenerates. Let a = b > c. Then the cubic equation (4.1) becomes a quadratic, 


a^ + u c^ + 


-= 1 , p^ = x^ + y^. 


(4.7) 


with two roots whose values lie in the ranges ^ ^ - c^, - c^ ^ t] > - a^- The coordinate 
surfaces of constant ^ and r] become respectively confocal oblate spheroids and confocal 
hyperboloids of revolution of one sheet (Fig. 8). As the third coordinate we can take the 
polar angle (}> in the xj-plane {x = pcos(}>, 3 ' = psin</)). For a = b the ellipsoidal 
coordinate C degenerates to a constant. - a^. Its relation to the angle (j) is given by the way 
in which it tends to -a^ as b tends to a. namely 

cost}) = ^[{a^ + CVia^-b^)] as b-^a. (4.8) 



Fig. 8 


This is easily seen from (4.4) or directly from (4.1). The relation between the coordinates 
2 , p and t] is given, according to (4.4), by 




L J’ 



(4.9) 


The coordinates t], (f> are called oblate spheroidal coordinates f 

Similarly, for a > b = c ellipsoidal coordinates become prolate spheroidal coordinates. 
Two coordinates ^ and C are roots of the equation 


x^ 



p2 = y2+zf 


(4.10) 


where ^ > -b^, > C > The surfaces of constant f and C are prolate spheroids 


f We here define spheroidal coordinates to be the limit of ellipsoidal coordinates. Other definitions are used in 
the literature, but are easily related to ours. 
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and hyperboloids of revolution of two sheets (Fig. 9). The coordinate rj degenerates to a 
constant, - b^, for c -+ b, and we have 

cos<l> = ^[{b^ + r,)/{b^-c^)], (4.11) 


where </> is the polar angle in the j z-plane. The relation between the coordinates x, p and 
C is given by 


In a system of oblate spheroidal coordinates the foci of the spheroids and hyperboloids 
lie on a circle of radius y/(a^ — c^) in the xj-plane; in Fig. 8 A A' is a diameter of this circle. 
Let us draw a plane passing through the z-axis and some point P. It intersects the focal 
circle at two points; let their distances from P be r i, rj • If the coordinates of P are p, z, then 

= [p-y/(a^-c^)f+z\ r/ = [p + ^{a^-c^)f+z\ 



Fig. 9 


The spheroidal coordinates tj are given in terms of r^, rj by 

^ = (4.13) 

In a system of prolate spheroidal coordinates the foci are the points x = ± ^/(a^ — b^) 
t^en ^ ^ points A, A' in Fig. 9). If and rj are the distances of these foci from P, 

f + [x - y (fl^ - b^) ]2, + [x + ,y (a2 _ (,2) ]2, 

nd the spheroidal coordinates C are given in terms of r,, r2 by the same formulae (4.13) 
'vith C in place oft]. 

Let us now turn to the problem of the field of a charged ellipsoid whose surface is given 
y the equation (4.3). In ellipsoidal coordinates this is the surface ^ = 0. It is therefore clear 
nat, if we seek the field potential as a function of f only, all the ellipsoidal surfaces 
9 - constant, and in particular the surface of the conductor, will be equipotential surfaces. 
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Laplace’s equation (4.6) then becomes 


The upper limit of integration is taken so that the field is zero at infinity. The constant 
A is most simply determined from the condition that at large distances r the field must 
become a Coulomb field and (f) = e/r, where e is the total charge on the conductor. When 
r -> 00, ^ 00, and ^ = r^, as we see from equation (4.1) with u = For large i we have 

R, S and = 2A/yJ^ = 2A/r. Hence 2A = e, and therefore 




The integral is an elliptic integral of the first kind. The surface of the conductor 
corresponds to ^ = 0, and so the capacitance of the conductor is given by 


The distribution of charge on the surface of the ellipsoid is determined by the normal 
derivative of the potential: 


^ 47cLdnJ^^o 47iL/ii d^ 4?! ^ (tjQ 

From equations (4.4) we easily see that for ^ = 0 


For a spheroid the integrals (4.14), (4.15) degenerate and can be expressed in terms of 
elementary functions. For a prolate spheroid {a> b = c) the field potentia is 


and the capacitance i; 


ccsh“‘(o/fc)’ 


(4.18) 
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For an oblate spheroid (a = b > c) we have 

I ^ — 

In particular, for a circular disc (a = b, c = 0) 

C = 2a/n. 


cos Hc/a)' 


(4.19) 


(4.20) 


Let us now consider the problem of an uncharged conducting ellipsoid in a uniform 
external electric field G. Without loss of generality we may take the field G to be along one 
of the axes of the ellipsoid. In any other case this field may be resolved into components 
along the three axes, and the resultant field is a superposition of those arising from each 
component separately. 

The potential of a uniform field G along the x-axis (the a-axis of the ellipsoid) is, in 
ellipsoidal coordinates, 

<^0 = - ex = - G-y [(i + + a^)(C + a^)/{b^ -a^){c^ (4.21) 

We write the field potential outside the ellipsoid as(j) = (p^ + tj)', where (j)' gives the required 
perturbation of the external field by the ellipsoid, and seek <^' in the form 


(4.22) 

In this function the factors depending on t] and C are the same as in (p^, this enables us to 
satisfy the boundary condition at ^ = 0 for arbitrary t], C (i.e. on the surface of the 
ellipsoid). Substituting (4.22) in Laplace’s equation (4.6), we obtain for f (^) the equation 

d^f df d 

One solution of this equation is f = constant, and the other is 


m = 






(4.23) 


The upper limit of integration is taken so that ^ 0 for ^ ^ oo. The integral is an elliptic 
integral of the second kind. 

'Ve must have (p = constant on the surface of the ellipsoid. For this condition to be 
satisfied with = 0 and arbitrary t], C, the constant value of (p must be zero. Determining 
c coefficient AinF (^) so that f (0) = — 1, we obtain the following final expression for the 
"wd potential near the ellipsoid: 




(4.24) 


Let us find the form of the potential <p' at large distances r from the ellipsoid. For 
*"86 r, the coordinate ^ is large, and f ^ as follows at once from equation (4.1). Hence 


r d^ _ r ds 
J (s + fl^)/?, - J s^/^~ 3 
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and the potential (f)' = Gx F/47i«<^V\ where V = |7rabc is the volume of the ellipsoid and 
are defined by 


f ds 
0 

f ds 

"J {s + cy 


r ds 
’"J (s+b^: 


(4.25) 


The expression for <^' is, as we should expect, the potential of an electric dipole: 
(j)' = xSPJr'^, where the dipole moment of the ellipsoid is 

VIAnn^^\ (4.26) 

Analogous expressions give the dipole moment when the field G is along the y or z axis. 

The positive constants n*""* depend only on the shape of the ellipsoid, and not on 

its volume; they are called the depolarizing factors.1; If the coordinate axes do not 
necessarily coincide with those of the ellipsoid, formula (4.26) must be written in the tensor 
form 

(47i/F)«i,^,= G,. (4.27) 


The quantities n^^^ are the principal values of the symmetrical tensor of rank 

two. Comparison with the definition (2.13) shows that a;^ = n is the polarizability 

tensor of the conducting ellipsoid. 

In the general case of arbitrary a, b. c, it follows from the definitions of that 

jj(x) jj(j’) jj(z) jf a > b > c. (4.28) 

Further, by adding the integrals for and using as the variable of integration 

u = R/, we find 



(abc)^ 


whence 

„U) ^ „(z) = 1 

The sum of the three depolarizing factors is thus unity; in tensor notation, 
these coefficients are positive, none can exceed unity. 

For a sphere (a = b = c) it is evident from symmetry that 


(4.29) 
= 1. Since 


(4.30) 


For a cylinder with its axis in the x-direction (a -»• co), we have J 


(4.31) 


+ The same coefficients occur in problems concerning a dielectric ellipsoid in an external electric field, or a 
magnetfc erpsord in a magnetic field (§8). Tables and graphs of the coefficients for spherojds and ellipsoids have 
been giverbrE. C. Stonev (Philosophical Magazine [7] 36, 803.19451 and J. A. Osborn (Physical Rcv.ev.- 67,351, 

'^J^These values for a sphere and a cylinder agree, of course, with those found in §3, Problems 1 and 2. 
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the limiting case a, b -> oc (a flat plate) corresponds to the obvious values 

„(x) ^ ^ 0, = 1. 

The elliptic integrals (4.25) can be expressed in terms of elementary functions if the 
ellipsoid is a spheroid. Fora prolate spheroid (a > b = c) of eccentricity e = ^( 1 - b^/a^X 


If the spheroid is nearly spherical (e 1) we have approximately 

-1_2_ 2 ly) _ (z) _ i , 2 

" — 3 15^ > n — n — 3 + isS ■ 

For an oblate spheroid {a = b > c) 

1 

= —^(e —tan”‘e), = 4(1 — «*^*), 

e 

where e = J{c^jc^ - 1). If e 1, then 


(4.32) 


(4.33) 


(4.35) 


Problem 1. Find the field of a charged conducting circular disc with radius a, expressing it in cylindrical 
coordinates. Find the distribution of charge on the disc. 

Solution. The charge distribution is obtained by taking the limit of formula (4.16) as c -* 0, z -* 0, with 
= vl* ~(where A — x^+y^), in accordance with (4.3). This gives 


The field potential is given in all space by formula (4.19), where we put c = 0 and express f in terms of r and z by 
means of equation (4.1) with c = 0, u = a = b: 


^^-«^ + v/l( r^ + z^-a^)^+4fl^z^l] ' 
[d z by coordinates p an( 

(p = 


(F,v^?n tl’e disc, we replace r and z by coordinates p and 6 such that z = p sin 6, r = a - p cos 6 

(F'g- to, p. 26; p«n), obtaining 


agreement with the general result derived in §3, Problem 3. 

Problem 2. Determine the electric quadrupole moment of a charged ellipsoid. 

C iMhp quadrupole moment tensor of a charged conductor is defined as D,* = €{3^. - r^d ..), where 

tal charge, and the bar denotes an average such as 

x,.Xj = * ^ XiXi^adJ. 

and fl'A?* ellipsoid are also the principal axes of the tensor D,j. Using formula (4.16) for a. 

or the element of surface of the ellipsoid the expression 
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Fig. 10 


where v is a unit vector normal to the surface, we obtain 

the integration over x and y covers twice the area of the cross-section of the ellipsoid by the xy^plane. Thus 
D„ = ie(2b"-c'-a'), = ie(2c^-u"-h')- 

Problem 3. Determine the distribution of charge on the surface of an uncharged conducting ellipsoid placed 
in a uniform external field. 


Solution. According to formula (1.9) we have 


If —] = - 1^——1 


47tl_finJ{ = o J« = o’ 

by (4.5) the element of length along the normal to the surface of the ellipsoid is ft, d^. Substituting (4.24) and ui 
the fact that 




we have a = Gv,/47tn<*> when the external field is in the x-direction. When the direction of the external fii 
arbitrary this becomes 


n-—vn-‘ G —G -e-^G -(--^gJ. 


Problem 4. The same as Problem 3, but for a plane circular disc with radius a lying parallel to the field.t 


PROBLEM 4. I ne same as urooiem dui lor 
Determine also the dipole moment of the disc. 


3. The 


Solution. Let us regard the disc as the limit of a spheroid when the semiaxis c tends to 
depolarizing factor along this axis (the z-axis) tends to 1, and those along the x and y axes " f 

= 1 - nc/2a, = nc/4a, by (4.34). The component v, of the unit vector along the normal to the surface o 

the spheroid tends to zero- 




-i 


v-f 


Hence the charge density is 


where p and ^ are polar coordinates in the plane of the disc. 


t The problem for a disc lying perpendicular to the field is trivial: the field remains uniform in all space, and 
charges cr = +G/47r are induced on the two sides of the disc. 
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The dipole moment of the disc is obtained from formula (4.26), and is ^ = 4a^ C/Stt. Thus it is proportional to 
^3 gfid not to the “volume” a^c of the disc. 

Problem 5. Determine the field potential outside an uncharged conducting spheroid with its axis of 
symmetry parallel to a uniform external field. 

Solution. For a prolate spheroid (a>b = c, with the field (£ in the x-direction) we find, on calculating the 
integral in formula (4.24), 

d. = - Gx ‘ v/ [(«' - + «")]- v/ [(«' - fc')/(« + «")]! 

1 tanh - V(1 - fcVa") - v/(l - b^/a^) ]' 


The coordinate ^ is related to x and p = ^(y^ + z^) by 




with 0 ^ ^ ^ 00 in the space outside the ellipsoid. 

For an oblate spheroid (a = fc > c) the field G is along the z-axis. We must therefore replace s + by s + c^ and 

put <po= ~ the integrals in (4.24). Then 

v/[(«^-o^)/(^ + o^)]-tan-V[(a^-c^)/(^ + c^)] j 

1 y(aVo"-l)-tan-V(aVo"-l) J 

where the coordinate ^ is related to z and p = (x^ + y^) by 


Problem 6. The same as Problem 5, but with the axis of symmetry perpendicular to the external field 
Solution. For a prolate spheroid (with the field along the z-axis) 

A- «:Ji J(i + a^m + b^)-(a^-b^)-"tZinh-'J[(a^-b^)m + a^)} } 

L a/f)^-(a^-fc^)-2tanh'*.y(l-fc^/a^) J 

For an oblate spheroid (with the field along the x-axis) 

«:J] (°^-^^)-'tan-V[(n^-c^)/(g-Ec^)]-y(g-Ec^)/(g-En^) ] 

1 (fl^ -c^)-han-‘y(aVo^ - 1)- c/a^ J ' 

corduc^L ]' ^ held (£ in the z-direction (in the half-space z < 0) is bounded by an earthed 

plane ^ at z — 0, containing a circular aperture. Determine the field and charge distribution on the 

limit ^‘P^ne with a circular aperture of radius a and centre at the origin may be regarded as the 

lie Hyperboloids of revolution of one sheet 


a"-I'll 




‘Coordinates witu are one of the families of coordinate surfaces in a system of oblate spheroidal 

^ = v({lnl)/rrnH P''‘f'an coordinate z, according to (4.9), is given in terms of ^ and ^ by 

‘^spectively ’ ^ must be taken with the positive and negative sign in the upper and lower half-space 

Let us seek a solution in the form d> = - Gzf (^). For the function F(^) we obtain 


f (^) = constant 


di 


theeo = 

\/{ ‘Pl^Sration is put equal to zero in accordance with the condition d' = 0rorz-»-l-oo ie 

not as 11 "^_*n''erse tangent of a negative quantity must be taken as tan" ‘ (a/~ = a - tan^ ‘ 

‘Coefficient ■ u '‘'i V s) since the potential would then be discontinuous at the aperture (^ = 0) The constant 

oient .s chosen so that, for z ^ - oo (i.e. for - - oo and tan- ‘ (n,V^) - a), ^ - Gz, and so we finally 


~'7i} 
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{e we have a dipole field, the moment of the dipole being 9» = (Sa^/Sn. 

’ The field decreases as l/r^ and therefore the flux of the field through an infinitely remote surface (in the hal 
space z > 0) is zero. This means that all the lines of force passing through the aperture reach the upper side of the 
conducting plane. 

The distribution of charge on the conducting plane is given by 

where the upper and lower signs refer to the upper and lower sides of the plane respectively. According to the 
formula , 

which relates «to p. z, we have = ± on the plane z = 0. Thus the charge distribution on the lower 

side of the conducting plane is given by the formula 


As p -► oc we have a = - G/47i, £ 


471^" sin p J 

ve should expect. On the upper side 


The total induced charge on the upper side of the plane is finite; 

e' = f a-Inp-Ap = — 


Problem 8. The same as Problem 7, but for a plane with a slit of width 2b. 

SOLUTION. The xy-plane with a slit along the x-axis may be regarded as the limit of the hyperbolic cylinders 


y 


b"-WI ItjI * 

.. „H 0. Th.« hyperbolic o,lW»s one of >h. 6-li» 0 , 00 ^ « • .y«™ »' 

coordinates with a - oo, c - 0. The Cartesian coordinate z^ ^ 


coordinates with a - qo, c - U, me cartesian coo.v < 'j j f tj,e function F 
As in Problem 7, we seek a solution in the form (?). ootaining lo 


d« 


f = constant 


Here the co« and the — of mtegrati^^ ‘ 


0 respectively (i.e. for 7^ ® 


,e now take positive and the tv 


+ correspond to the regions z > 0 and z < 0. 
e upper half-space « £ y^'-l-z" = r^ and the potential is 
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)psib^\/{\’l\/i) = i.e. the field of a two-dimensional dipole of moment per 

slit (see the formula in §3, Problem 2). 

The distribution of charge on the conducting plane is given by 


If 

8rrVv/(>'"-fc") 


unit length of the 


The total induced charge on the upper side of the plane, per unit length of the slit, is 


e' = 2j oAy= -Gfc/47r. 

Near the edge of the slit we can take ^ 0 in the expression for <^)(^, tf), obtaining 

t]=— 2bp sin^^, 

where p and 6 are polar coordinates in the yz-plane, measured from the slit edge (y = b + p cos 6, z = p sin &)■ 
Then 

4 > = eV(ifcp)sinie, 

in agreement with the result in §3, Problem 3, for the case I. 


§5. The forces on a conductor 

In an electric field certain forces act on the surface of a conductor. These forces are easily 
calculated as follows. 

The momentum flux density in an electric field in a vacuum is given by the Maxwell stress 
tensor.f 

The force on an element df of the surface of the body is just the flux of momentum through 
it from outside, and is therefore = CTi^n^d/ (the sign is changed because the normal 
vector n is outwards and not inwards). The quantity is thus the force per unit area 
of the surface. Since, at the surface of a conductor, the field E has no tangential component, 
we obtain 

rj = n£V87c, (5.1) 

or, introducing the surface charge density cr, 

Fj = Ino^n = jaE. 

We therefore conclude that a “negative pressure” acts on the surface of a conductor; it is 
irecte along the outward normal to the surface, and its magnitude is equal to the energy 
density in the field. 

The total force F on the conductor is obtained by integrating the force (5.1) over the 
Whole surface}: 


F= |(£V87t)df. 


tensor Ti, ■ ^ ’c u iciisor is usually oennea with the opposite sign to the momentum flux density 
definif has been used elsewhere in the Course of Theoretical Physics, but by an oversight the 

uiuion ot in Fields, §33, had the other sign. 

" *he present case we are applying this formula to a surface which does not precisely coincide with that of the 
e distance away, in order to exclude the effect of the field structure near the surface (see §1). 


body, but is 
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Usually, however, it is more convenient to calculate this quantity from the general laws of 
mechanics, by differentiating the energy The force, in the direction of a coordinate q 
acting on a conductor is -d^Wjdq, where the derivative signifies the rate of change of 
energy when the body is translated in the q-direction. The energy must be expressed in 
terms of the charges on the conductors (which give rise to the field), and the differentiation 
is performed with the charges constant. Denoting this by the suffix e, we write 

F^= -{d^^jdqX- ( 5 - 3 ) 

Similarly, the projection, on any axis, of the total moment of the forces on the conductor is 
K= -{8‘^/d\l/X, ( 5 . 4 ) 


where f]/ is the angle of rotation of the body about that axis. 

If however, the energy is expressed as a function of the potentials of the conductors, and 
not of their charges, the calculation of the forces from the energy requires special 
consideration. The reason is that, to maintain constant the potential of a moving 
conductor, it is necessary to use other bodies. For example, the potential of a conductor can 
be kept constant by connecting it to another conductor of very large capacitance, a “charge 
reservoir”. On receiving a charge the conductor takes it from the reservoir, whose 
potential <t>„ is unchanged on account of its large capacitance, although its energy is 
reduced by When the whole system of conductors receives charges the energy of 
the reservoirs connected to them changes by a total of -'Ee^tpa- Only the energy of the 


conductors, and not that of the reservoirs, appears in ' 


'. In this sense we can say that 


pertains to a system which is not energetically closed. Thus, for a system of conductOTS 
whose potentials are kept constant, the part of the mechanical energy is played not by 
but by 

(5-5) 


Substituting (2.2), we find that and # differ only in sign: 


(5.6) 


The force f, is obtained by differentiating with respect to q for constant potentials, i.e. 

f ^ = - (d ^/dq\ = {d ‘^/dqX. (5.7) 

Thus the forces acting on a conductor can be obtained by differentiating ^ either for 
constant charges or for constant potentials, the only difference being that de''‘''^tive 
must be taken with the minus sign in the first case and with the plus sign m the seconU 
The same result could be obtained more formally by starting from the differential 
‘t(^*’tity ^ (5 g) 


^ ^ </)„dc„ - f,dq. 


in which ^ is regarded as a function of the charges on the conductors and the coordinate q- 

TffisM;"t^^^^ = ./>„andd^/dq = -f,Usingthevariablesi/>„insteadof 

we have 


d - F,dq, 


(5.9) 


which gives (5.7). 
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At the end of §2 we have discussed the energy of a conductor in a uniform external 
electric field. The total force on an uncharged conductor in a uniform field is, of course, 
zero. The expression for the energy (2.14) can, however, be used to determine the force 
acting on a conductor in a quasi-uniform field G, i.e. a field which varies only slightly over 
the dimensions of the conductor. In such a field the energy can still be calculated, to a first 
approximation, from formula (2.14), and the force F is the gradient of this energy: 

F = - grad grad (CE,.Gj. (5.10) 

The total torque K is in general non-zero even in a uniform external field. By the general 
laws of mechanics K can be determined by considering an infinitesimal virtual rotation of 
the body. The change in energy in such a rotation is related to K by ^ 
being the angle of the rotation. A rotation through an angle in a uniform field is 
equivalent to a rotation of the field through an angle — relative to the body. The change 
in the field is = — ^«)/ x C. and the change in energy is 

^‘^ = (d‘^/dG)-^G= -^^Iz-Gxd'^/dG. 


But d'^/dG = — as we see from a comparison of formulae (2.13) and (2.14). Hence 


- ^ X G • d«)/, whence 


K= ^xG 


(5.11) 


in accordance with the usual expression given by the theory of fields in a vacuum. 

If the total force and torque on a conductor are zero, the conductor remains at rest in the 
field, and effects involving the deformation of the body (called electrostriction) become 
important. The forces (5.1) on the surface of the conductor result in changes in its shape 
and volume. Because the force is an extending one, the volume of the body increases. A 
complete determination of the deformation requires a solution of the equations of the 
theory of elasticity, with the given distribution of forces (5.1) on the surface of the body. If, 
however, we are interested only in the change in volume, the problem can be solved very 
simply. 

To do so, we must bear in mind that, if the deformation is slight (as in fact is true for 
electrostriction), the effect of the change of shape on the change of volume is of the second 
order of smallness. In the first approximation, therefore, the change in volume can be 
regarded as the result of deformation without change in shape, i.e. as a volume expansion 
under the action of some effective excess pressure AP which is uniformly distributed over 
the surface of the body and replaces the exact distribution given by (5.1). The relative 
change in volume is obtained by multiplying AP by the coefficient of uniform expansion of 
t e substance. The pressure AP is given, according to a well-known formula, by the 
erivative of the electric energy of the body with respect to its volume: AP = 

— deforming field be due to the charged conductor itself. Then the energy 

' and the pressure is AP = —je^dC~^/dV. For a given shape, the capacitance of 
e body (having the dimensions of length) is proportional to the linear dimension, i.e. to 
^ ' ■ Hence 


AP = c^jeCV = c</)/6F. 


(5.12) 


+ The quantity thus determined is the pressure exerted on the surface by the body itself; the pressure acting on 
me surface from outside is obtained by changing the sign. 
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If an uncharged conductor is situated in a uniform external field Cf, its energy is given by 
formula (2.14) The extending pressure is therefore 

AP = 4a (5-13) 


Problem 1. A small conductor with capacitance c (equal in order of magnitude to its dimension) is at a 
distance r from the centre of a spherical conductor with large radius a{> c). The distance r - a from the conductor 
to the surface of the sphere is supposed large compared with c, but not large compared with a. The two conductors 
are joined by a thin wire, so that they are at the same potential <p. Determine the force of their mutual repulsion. 

Solution. Since the conductor c is small, we can suppose that its potential is the sum of the potential <}>a/rat 
a distance r from the centre of the large sphere and the potential e/c due to the charge e on the conductor itsell. 
Hence <j) = <^>a/r + e/c, or e = ci/> (1 - a/r). The required force of interaction F is the Coulomb repulsion between 
the charge e on the conductor and the charge a<p on the sphere: 

ac</>Y, 


This expression is correct to within terms of higher order in c. The force is greatest when r - 3a/2, and its value 
there is fniax = 4ci/>^/27a, decreasing on either side of this distance. 


Problem 2. A charged conducting sphere is cut in half Determine the force of repulsion between the 
hemispheres.t 

Solution. We imagine the hemispheres separated by an infinitely narrow slit, and determine the force F on 
each of them by integrating over the surface the force (F^/Stt) cos 6, which is the component of (5.1) in a direction 
perpendicular to the plane of separation of the hemispheres. In the slit £ = 0, and on ‘he outer surface E - e/a , 
where a is the radius of the sphere and e the total charge on it. The result is F = e /oa . 

Problem 3 The same as Problem 2, but for an uncharged sphere in a uniform external field (S perpendicular 


Problem 3. The same as Problem 2, but for an uncharged sphere in a uniform external field (£ perpendicular 
to the plane of separation. 

Solution. As in Problem 2, except that the field on the surface of the sphere is £ = 3 G cos 6 (§3, Problem 1). 
The required force is F = 9a^(E^/16. 

Problem 4. Determine the change in volume and in shape of a conducting sphere in a uniform external 
electric field. 

Solution. The change in volume AF/F = AP/X, where K is tl^ modulus of volume expansion of the 
material, and AP is given by formula (5.13). For a sphere, (§3, Problem 1), so th 

^ As a result of the deformation, the sphere is changed into a prolate spheroid. To determine 
may regard the deformation as a uniform pure shear in the volume of the body, just as, to determine the change 
the total volume, we regarded it as a uniform volume expansion. • • .u ♦ *1, the 

The condition of equilibrium for a deformed body may be formulated as requiring that the 
electrostatic and elastic energies should be a minimum. The former is, by (2.12) anO (4.20), 

V 3FG^ 3Fa-fc„ 

^--® > 


where P is the original radius of the sphere, n and b the semiaxes of the spheroid, and n S 4 - 4 (n - b)/l5R is the 
.bo«. .h.ol,h. Wd 

U and u -u of the strain tensor are non-zero. Since we are considering equilibrium with respect to a cnangc 
rni\7i^n regard ^ volume as unchanged, i.e. u,, = 0. Hence the elastic energy may be written 




IS 2 and 3 
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where is the elastic stress tensor (TE, §4). We have (t„ - where n is the modulus of rigidity of 

the material, and u„ — = (a — b)/R. Hence 

%t =ili{a-bfV'R^. 

Making the sum ®ei “ minimum, we have (a-bMR = 9G^/407t/j. 

Problem 5. Find the relation between frequency and wavelength for waves propagated on a charged plane 
surface of a liquid conductor (in a gravitational field). Obtain the condition for this surface to be stable (Ya. I. 
Frenkel’, 1935). 

Solution. Let the wave be propagated along the x-axis, with the z-axis vertically upwards. The vertical 
displacement of points on the surface of the liquid is C = ~ When the surface is at rest, the field above it is 

E^ = E = 47r<To, and its potential <p = — ^na^z, where is the surface charge density. The potential of the field 
above the oscillating surface can be written as <!> = —47zaoZ + <t>i, withd>, = constant x being a 

small correction which satisfies the equation A = 0 and vanishes for z -► oo. On the surface itself the potential 
must have a constant value, which we take to be zero, and so <}>i = for z = 0. 

According to (5.1), an additional negative pressure acts on the charged surface of the liquid; this pressure is, as 
far as terms of the first order in (f>i, E^/Sn = Ef/Sn = 2na^^ + [kno^i ]r=o = 27r(To^ + The constant 

term 2naf is of no importance, since it can be included in the constant external pressure. 

The consideration of the hydrodynamical motion in the wave is entirely analogous to the theory of capillary 
waves (FM, §61), differing only by the presence of the additional pressure mentioned above. At the surface of the 
liquid we have the boundary condition pgl^ + — 4na„^kC = 0, where a is the surface- 

tension coefficient, p the density of the liquid, and ® its velocity potential. <t> and C are also related by 
dC/dt = [f)®/dz]j„o. Substituting in these two relations C = and ® satisfies the 

equation A® = 0) and eliminating a and A, we find the required relation between k and co: 

co^ = k(gp — AnoQ^k + ak^)lp. (1) 

If the surface of the liquid is to be stable, the frequency co must be real for all values of k (since otherwise there 
would be complex co with a positive imaginary part, and the factor would increase indefinitely). The 
condition for the right-hand side of (1) to be positive is (4nao^f — 4gpa < 0, or af < gpa/4Ti^. This is the 
condition for stability. 

Problem 6. Find the condition of stability for a charged spheri<;al drop (Rayleigh, 1882). 

Solution. The sum of the electrostatic and surface energies of the drop is ® = e^/2C -I- aS, where a is the 
surface-tension coefficient of the liquid, C the (capacitance of the drop and S its surface area. Instability occurs 
(with increasing e) with respect to deformation of the sphere into a spheroid, and does so when ® becomes a 
decreasing function of the eccentricity (for a given volume). The spheritcal shape always corresponds to an 
extremum of the stability condition is therefore ®/()(a — fc)^ ],=i, > 0, where a and b are the semiaxes of the 
spheroid, and the differentiation is carried out with ab^ = constant. Using the formula for the surface of a 
spteroid and (4.18) for its capacitance, we find after a somewhat lengthy calculation < 16na^a. 

This condition ensures stability of the drop with respect to small deformations. It is found to be weaker than the 
coiidition for stability with respect to large deformations that divide the drop into two equal drops with charge ie 
^nd radius 0/2*'^; 

< Ibna^a (2*'^ - l)/(2 - 2*'^) = 0.35 x 16na^a. 
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ELECTROSTATICS OF DIELECTRICS 


§6. The electric field in dielectrics 

W E SHALL no w go on to consider a static electric field in another class of substances, namely 
dielectrics. The fundamental property of dielectrics is that a steady current cannot flow m 
them. Hence the static electric field need not be zero, as in conductors, and we have to 
derive the equations which describe this field. One equation is obtained by averaging 
equation (1.3), and is again ^ ^ ^ ^ 

A second equation is obtained by averaging the equation div e = 47tp: 

div E = 4jip. (6-2) 

Let us suppose that no charges are brought into the dielectric from outside, which is the 
most usual and important case. Then the total charge in the volume of the dielectric is zero; 
even if it is placed in an electric field we have Jpd f = 0. This integral equation, which must 
be valid for a body of any shape, means that the average charge density can be written as 
the divergence of a certain vector, which is usually denoted by — P: 

p=-divP, (6-3) 

while outside the body P = 0. For, on integrating over the volume bounded by a surface 
which encloses the body but nowhere enters it, we find Jpd F = — j dtv PdF = —|P-df 
= 0. P is called the dielectric polarization, or simply the polarization, of the body. A 
dielectric in which P differs from zero is said to be polarized. The vector P determines not 
only the volume charge density (6.3), but also the density cr of the charges on the surface of 
the polarized dielectric. If we integrate formula (6.3) over an element of volume lying 
between two neighbouring unit areas, one on each side of the dielectric surface, we have, 
since P = 0 on the outer area (cf the derivation of formula (1.9)), 

a = P„, (6-4) 

where P„ is the component of the vector P along the outward normal to the surface. 

To see" the physical significance of the quantity P itself, let us consider the total dipole 
moment of all the charges within the dielectric; unlike the total charge, the total dipole 
moment need not be zero. By definition, it is the integral | rpd I'. Substituting p from (6.3) 
and again integrating over a volume which includes the whole body we have 
Jrpdf = - JrdivPdf = - fr(df-P)+J(P-grad)rdF. 

The integral over the surface is zero, and in the second term we have (P -gradlr = P, so 
that |rpdf=lPdF. (6-5) 
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Thus the polarization vector is the dipole moment (or electric moment) per unit volume of 
the dielectric.t 

Substituting (6.3) in (6.2), we obtain the second equation of the electrostatic field in the 
divD = 0, (6-6) 

where we have introduced a quantity D defined by 

D = E + 47tP, (6.7) 

called the electric induction.The equation (6.6) has been derived by averaging the density of 
charges in the dielectric. If, however, charges not belonging to the dielectric are brought in 
from outside (we shall call these extraneous charges), then their density must be added to 
the right-hand side of equation (6.6): 

div D = 

On the surface of separation between two different dielectrics, certain boundary 
conditions must be satisfied. One of these follows from the equation curl E = 0. If the 
surface of separation is uniform as regards physical properties,t this condition requires the 
continuity of the tangential component of the field: 

E,i = E,^; (6-9) 

cf the derivation of the condition (1.7). The second condition follows from the equation 
div D = 0, and requires the continuity of the normal component of the induction: 

= D„2. (6.10) 

For a discontinuity in the normal component D„ = would involve an infinity of the 
derivative dDJPz, and therefore of div D. 

At a boundary between a dielectric and a conductor, E, = 0, and the condition on the 
normal component is obtained from (6.8): 

E, = 0, D„ = 47t<Tex> 

where Oex the charge density on the surface of the conductor; cf (1.8), (1.9). 

§7. The permittivity 

In order that equations (6.1) and (6.6) should form a complete set of equations 
determining the electrostatic field, they must be supplemented by a relation between the 
induction D and the field E. In the great majority of cases this relation may be supposed 
linear. It corresponds to the first terms in an expansion of D in powers of E, and. its 
correctness is due to the smallness of the external electric fields in comparison with the 
internal molecular fields. 

The linear relation between D and E is especially simple in the most important case, that 


t It should be noticed that the relation (6.3) inside the dielectric and the condition P = 0 outside do not in 
themselves determine P uniquely; inside the dielectric we could add to P any vector of the form curl f. The exact 
form of P can be completely determined only by establishing its connection with the dipole moment. 

t That is, as regards composition of the adjoining media, temperature, etc. If the dielectric is a crystal, the 
surface must be a crystallographic plane. 
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of an isotropic dielectric. It is evident that, in an isotropic dielectric, the vectors D and E 
must be in the same direction. The linear relation between them is therefore a simple 
proportionality:t 

D = £E. (7.1) 

The coefficient e is the permittivity or dielectric permeability or dielectric constant of the 
substance and is a function of its thermodynamic state. 

As well as the induction, the polarization also is proportional to the field: 

P = kE s (e— l)E/47t. (7.2) 

The quantity k is called the polarization coefficient of the substance, or its dielectric 
susceptibility. Later (§14) we shall show that the permittivity always exceeds unity; the 
polarizability, accordingly, is always positive. The polarizability of a rarefied medium (a 
gas) may be regarded as proportional to its density. 

The boundary conditions (6.9) and (6.10) on the surface separating two isotropic 
dielectrics become 

E,i = E,2, £i£„i = £2£„2. C^-^) 

Thus the normal component of the field is discontinuous, changing in inverse proportion 
to the permittivity of the medium. 

In a homogeneous dielectric, e = constant, and then it follows from div D = 0 that 
div P = 0. By the definition (6.3) this means that the volume charge density in such a body 
is zero (but the surface density (6.4) is in general not zero). On the other hand, in an 
inhomogeneous dielectric we have a non-zero volume charge density 

p = - div P = - div^^P^D = - • grad^—^ = - ^E • grad e. 

47tE 471 E 47tE 

If we introduce the electric field potential by E = — grad</), then equation (6.1) is 
automatically satisfied, and the equation div D = div eE = 0 gives 

div (e grad </>) = 0. (7.4) 

This equation becomes the ordinary Laplace’s equation only in a homogeneous dielectric 
medium. The boundary conditions (7.3) can be rewritten as the following conditions on the 
potential: 



the continuity of the tangential derivatives of the potential is equivalent to the continuity of 
(j) itself. 

In a dielectric medium which is piecewise homogeneous, equation (7.4) reduces in each 
homogeneous region to Laplace’s equation A</) = 0, so that the permittivity appears in the 
solution of the problem only through the conditions (7.5). These conditions, however. 
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involve only the ratio of the permittivities of two adjoining media. In particular, the 
solution of an electrostatic problem for a dielectric body with permittivity Ej, surrounded 
by a medium with perpiittivity Ei, is the same as for a body with permittivity £ 2 / 61 , 
surrounded by a vacuum. 

Let us consider how the results obtained in Chapter I for the electrostatic field of 
conductors will be modified if these conductors are not in a vacuum but in a homogeneous 
and isotropic dielectric medium. In both cases the potential distribution satisfies the 
equation /Lcj) = 0, with the boundary condition that 4' is constant on the surface of the 
conductor, and the only difference is that, instead of £„ = - dtj) 'dn = Ana. we have 

= —EO^ISn = Ana. (7.6) 

giving the relation between the potential and the surface charge. Hence it is clear that the 
solution of the problem of the field of a charged conductor in a vacuum gives the solution 
of the same problem with a dielectric in place of the vacuum if we make the formal 
substitution (j> -* e4, c-»eor<^—e/e. For given charges on the conductors, the 
potential and the field are reduced by a factor e in comparison with their values in a 
vacuum. This reduction in the field can be explained as the result of a partial “screening” of 
the charge on the conductor by the surface charges on the adjoining polarized dielectric. If, 
on the other hand, the potentials of the conductors are maintained, then the field is 
unchanged but the charges are increased by a factor e.t 

Finally, it may be noted that in electrostatics we may formally regard a conductor 
(uncharged) as a body of infinite permittivity, in the sense that its effect on an external 
electric field is the same as that of a dielectric (of the same form) as £ ^ cx). For. since the 
boundary condition on the induction D is finite, D must remain finite in the body even for 
£ ^ cx). This means that E ->■ 0, in accordance with the properties of conductors. 

PROBLEMS 

Problem 1. Determine the field due to a point charge e at a distance h from a plane boundary separating two 
different dielectric media. 

Solution. Let O be the position of the charge e in medium 1, and O' its image in the plane of separation, 
situated in medium 2 (Fig. 11, p. 38). We shall seek the field in medium 1 in the form of the field of two point 
charges, e and a fictitious charge e' at O' (cf. the method of images, §3): <^>1 = e/e, r + e'/e, r', where r and r' are the 
distances from O and O' respectively. In medium 2 we seek the field as that of a fictitious charge e" at O: 
P 2 = e"/E2 r. On the boundary plane (r = r') the conditions (7.5) must hold, leading to the equations e-e' = e", 
w + e )/e, = £"/£j, whence 

£' = £(£,-£2)/(£,+£2), £" = 2£2£ (£,+£2)- (1) 

For £2 -+ cc we have e' = -e. (j>.= 0, i.e. the result obtained in §3 for the field of a point charge near a 
conducting plane. 

he force acting on the charge e (the image force) is 



F > 0 corresponds to repulsion. 

Problem 2. The same as Problem 1, but for an infinite charged straight wire parallel to a plane boundary 
surface at a distance h. 


f From this it follows, in particular, that when a capacitor is filled with a 
'actor f. 


1 diclectic its capacitance increases by; 
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Fig. 11 

Solution. As in Problem 1, except that the field potentials in the two media are 4>i = -(2e/£i) log r 
— (2e'le,) log r\ 4>2= — ( 2 £' 7 £ 2 ) log where e, e', e" are the charges per unit length of the wire and of its images, 
and r, r are the distances in a plane perpendicular to the wire. The same expressions (1) are obtained for e', e", and 
the force on unit length of the wire is F = lee'llhe^ = £^(£, — e 2 )/fi£,(e, +£ 2 ). 

Problem 3. Determine the field due to an infinite charged straight wire in a medium with permittivity £,, 
lying parallel to a cylinder with radius a and permittivity Ej, at a distance b{> a) from its axis.t 

Solution. We seek the field in medium 1 as that produced in a homogeneous dielectric (with e,) by the 
actual wire (passing through O in Fig. 12), with charge e per unit length, and two fictitious wires with charges e' 
and — e' per unit length, passing through A and O' respectively. The point >1 is at a distance a^/b from the axis of 
the cylinder. Then, for all points on the circumference, the distances r and F from O and A are in a constant ratio 
r'/r = a/b, and so it is possible to satisfy the boundary conditions on this circumference. In medium 2 we seek the 
field as that produced in a homogeneous medium (with £ 2 ) by a fictitious charge e" on the wire passing through O. 

The boundary conditions on the surface of separation are conveniently formulated in terms of the potential (fi 
(E = — grad 4>) and the vector potential A (cf. §3X defined by D = curl A (in accordance with the equation 
div D = 0). In a two-dimensional problem, A is in the 2 -direction (perpendicular to the plane of the figure). The 
conditions of continuity for the tangential components of E and the normal component of D are equivalent to 
(f>^ =(f>2,Ai=A2. 

For the field of a charged wire we have in polar coordinates r, 6 the equation (fi = - (2e/£) log r-I-constant, 
A = 2ee + constant; cf (3.18). Hence the boundary conditions are 

2 2e" 

—(— e log r — cTog r' + e'log a) =-log r + constant, 

2 [efl + e'ff -e'(e + 6') ] = 2e"e, 



t The corresponding problem of a point charge near a dielectric sphere cannot be solved in closed form. 

I 
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where the angles are as shown in Fig. 12, and we have used the fact that OO'B and BO'A are similar triangles. 
Hence fL 2 (e +e) = , e-e = e", and the expressions for e' and e" are again formulae (1) of Problem 1 

The force acting on unit length of the charged wire is parallel to OO', and is 

F = eE = ~(- _1 ^ 

c. VO/1 OO'J £i(£i+£2)fc(fc"-fl")’ 

T > 0 corresponds to repulsion. In the limit a.bcc, b-a h, this gives the result in Problem 1. 

Problem 4. The same as Problem 3, but for the case where the wire is inside a cylinder with permittivity 

Solution. We seek the field in medium 2 as that due to the actual wire, with charge e per unit length (O in 
Fig. 13), and a fictitious wire with charge c' per unit length passing through A, which is now outside the cylinder 
In medium 1 we seek the field as that of wires with charges e" and e - e" passing through O and O' respectively By 

the same method as in the preceding problem we find £'= -e(£,-£2)/(£,+£2), c" = 2£,£/(£,+£,) For£ >e 
the wire is repelled from the surface of the cylinder by a force ^ ‘ 


2 OA Cjlci+£2)10^ — 



Fig. 13 


Problem 5. 
medium, due t< 

Solution. 


Show that the field potential (pyiirg) at a point in an 
3 a point charge e at r^, is equal to the potential (pgr^) at 

The potentials and (pglr) satisfy the equations 


arbitrary inhomogeneous dielectric 
ty, due to the same charge at rg. 


div (£ grad tp^,) - 47r£^(r - r,^), div (£ grad (Pb)= - 47 r£^(r - r^). 

Multiplying the first by 4 >b and the second by 4 ,^ and subtracting, we have 
^ div {4>Bi^WraA4>A) - div t(P^ £ grad 4>b) = - 47t£^(r - t^)4>B(t) + 47teS(r - rePpAfr). 

egration of this equation over all space gives the required relation: 


4>A(rB) = <pB(rA)- 


§8. A dielectric ellipsoid 

unls^uToronTrS" of a dielectric ellipsoid in a uniform external electric field has some 
usual properties which render this example particularly interesting. 

We denntTu^^ oase, that of a dielectric sphere in an external field ©. 

one? r u P.®™i“ivity by £<■>, and that of the medium surrounding it by e<-'. We take the 
gin of spherical polar coordinates at the centre of the sphere, and the direction of G as 
he axis from which the polar angle 6 is measured, and seek the field potential outside the 
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sphere in the form = (f • r + /4(f • r/r^; the first term is the potential of the external field 
imposed, and the second, which vanishes at infinity, gives the required change in potential 
due to the sphere (cf §3, Problem 1, solution). Inside the sphere, we seek the field potential 
in the form = -P(f -r, the only function which satisfies Laplace’s equation, remains 
finite at the centre of the sphere, and depends only on the constant vector (£ (which is the 
only parameter of the problem). 

The constants A and B are determined by the boundary conditions on the surface of the 
sphere. It may be seen at once, however, that the field in the sphere E*'* = is uniform 
and differs only in magnitude from the applied field (f. 

The boundary condition ofcontinuityofthe potential gives E'" = (f(l -/4/P^), where P 
is the radius of the sphere, and the condition of continuity of the normal component of the 
induction gives 

D(0 = eMe(l+2/l/p3). 

Eliminating A from these two equations, we obtain 

i(D"> + 2£'‘’'E">) = e'*'e (8.1) 

or, substituting D*'* = e‘'’E‘'’, 

E<-' = 3£<‘’>e;(2e<‘’'+ £*'>). (8.2) 

The problem of an infinite dielectric cylinder in an external field perpendicular to its axis 
is solved in an entirely similar manner (cf. §3, Problem 2). The field inside the cylinder, like 
that inside the sphere in the above example, is uniform. It satisfies the relation 

i(D<'' + £“’’E<'')= (8-3) 

E<‘> = 2£'‘’>e/(e‘®'+ £*'*). (8.4) 

The relations (8.1) and (8.3), in which the permittivity e*'* of the sphere or cylinder does 
not appear explicitly, are particularly important because their validity does not depend on 
a linear relation between E and D within the body; they hold whatever the form of this 
relation (e.g. for anisotropic bodies). The analogous relations 

£“> = e (8.5) 

for a cylinder in a longitudinal field and 

D'-' = eX-’G (8.6) 

for a flat plate in a field perpendicular to it are similarly valid; these relations are evident at 
once from the boundary conditions. 

The property of causing a uniform field within itself on being placed in a uniform 
externa] field is found to pertain to any ellipsoid, whatever the ratio of the semiaxes a, b, c. 
The problem of the polarization of a dielectric ellipsoid is solved by the use of ellipsoidal 
coordinates, in the same way as the corresponding problem for a conducting ellipsoid 
in §4. 

Let the external field be again in the x-direction. The field potential outside the ellipsoid 
may again be sought in the form (4.22): 4>\ = with the function F (f) given by (4.23). 

Such a function cannot, however, appear in the field potential inside the ellipsoid, since 
it does not satisfy the condition that the field must be finite everywhere inside the ellipsoid. 



gg A dielectric ellipsoid 41 

For let us consider the surface ^ = — c^, which is an ellipse in the x>’-plane, with semiaxes 
J{a^ — c^) and ^(b^ — c\ lying within the ellipsoid. For —c^, the integral (4.23) 
behaves as ^ (c + c^). The field, i.e. the potential gradient, therefore behaves as 1/y^(c + c^), 
and becomes infinite at = - c^. Thus the only solution suitable for the field inside the 
ellipsoid is F{^) = constant, so that (pj must be sought in the form (pi = Bcpf,. We see that 
the potential (p^ differs only by a constant factor from the potential (po of the uniform field. 
In other words, the field inside the ellipsoid is also uniform. 

We shall not pause to write out the formulae for the field outside the ellipsoid. The 
uniform field inside the ellipsoid can be found without actually writing out the boundary 
conditions, by using some results already known. 

Let us first suppose that the ellipsoid is in a vacuum (e*'’* = 1). Then there must be a linear 
relation between the vectors E“*, D‘‘’ and (f (which are all in the x-direction), of the form 
+ bD*‘\ = (£„ where the coefficients a, b depend only on the shape of the ellipsoid, 
and not on its permittivity e''!. The existence of such a relation follows from 
the form of the boundary conditions, as we saw above in the examples of the sphere and the 
cylinder. 

To determine a and b we notice that, in the trivial particular case e*'* = 1, we have simply 
E = D = e, and so a + b= 1. Another particular case for which the solution is known is 
that of a conducting ellipsoid. In a conductor E‘‘* = 0, and the induction D‘‘’, though it has 
no direct physical significance, may be regarded formally as being related to the total dipole 
moment of the ellipsoid by D*'’ = 47iP = 47i V. According to (4.26) we then have 
i.e. b = n**’, and so a = 1 Thus we conclude that 

(l-n'"')£'^ + n'">D"\=G„ (8.7) 

G^_47m'">P,. (8.8) 

The quantity is called the depolarizing field.i Similar relations, but with 

coefficients.^*^*, hold for the fields in the y and z directions. Like the particular formulae 
(8.1) and (8.3), they are valid whatever the relation between E and D inside the ellipsoid. 

The field inside the ellipsoid is found from (8.7) by putting D*'*,, = 

and the total dipole moment of the ellipsoid is 

3^^ = F£, = (£<■> - 1)F£<'\/47i = lafecfe'*' - 1)G,/[1 + (e'*' - l)n‘^>]. (8.10) 

If the field (f has components along all three axes, then the field inside the ellipsoid is still 
**nifonn, but in general not parallel to (f. For an arbitrary choice of coordinate axes we can 
'''rite the relation (8.7) in the general form 

£<-■>.. + - £'‘\) = G,.. (8.11) 

The transition to the case where the permittivity of the medium differs from unity is 
effected by sfeply replacing £*'* by £“’ £**'*. Then formula (8.7) becomes 

(1 - n<^')e<'^'£<'\ + 

t Similar formulae hold for a magnetized ellipsoid in a uniform external magnetic field (see §29). In 
n'*’ are called demagnetizing factors. 


(8.12) 
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This formula can be applied, in particular, to the field inside an ellipsoidal cavity in an 
infinite dielectric medium. In this case £*‘* = 1. 


PROBLEMSt 

Problem t. Determine the torque on a spheroid in a uniform electric field. 

Solution. Accordingtothegeneralformula(16.13), thetorqueonanellipsoidisK = ^ x C, where is the 
dipole moment of the ellipsoid. In a spheroid, the vector is in a plane passing through the axis of symmetry and 
the direction of C. The torque is perpendicular to this plane, and a calculation of its magnitude from formulae 
( 8 . 10 ) gives 

(c-l)^|l-3n|G^Fsin 2a 
87t[n£ + 1 - n][(l - n)c + 1 + n]’ 

where a is the angle between the direction of G and the axis of symmetry of the spheroid, and n is the 
depolarizing factor along the axis (so that the depolarizing factors in the directions perpendicular to the axis are 
^(1 — n)). The torque is directed so that it tends to turn the axis of symmetry of a prolate (n < or oblate (n > 
ellipsoid parallel or perpendicular to the field respectively. 

For a conducting ellipsoid (c -* oo) we have 


K = 


|1-3«| 

87 t«(l -n) 


FG2sin2a. 


Problem 2. A hollow dielectric sphere (with permittivity e and internal and external radii b and a) is in a 
uniform external field G. Determine the field in the cavity. 

Solution. As above in the problem of a solid sphere, we seek the field potentials in the vacuum 
outside the sphere (region 1) and in the cavity (region 3) in the forms — G cos e(r —A/r^), 

(^>3 = -BGrcosfl, and that in the dielectric (region 2) as <^>2 = - CG cos 6 (r - D/r^), where A, B, C, D are 
constants determined from the conditions of continuity of (j> and c d4>/8r at the boundaries 1—2 and 2 — 3. Thus 
the field E 3 = BG in the cavity is uniform, but the field Ej in the sphere is not. A calculation of the constants gives 
the result 


E3 = 9£G/ [(£ + 2)(2£ +1) - 2(£ -1)^ (b/a)^ ]. 

Problem 3. The same as Problem 2, but for a hollow cylinder in a uniform transverse field.t 
Solution. As in Problem 2, with the result 

E3 = 4£G/[(£ + 1)2 - (£ - l)2(fc/o)2 ]. 


§9. The permittivity of a mixture 

If a subtance is a finely dispersed mixture (an emulsion, powder mixture, etc.), we can 
consider the electric field averaged over volumes which are large compared with the scale of 
the inhomogeneities. The mixture is a homogeneous and isotropic medium with respect to 
such an average field, and so may be characterized by an effective permittivity, which we 
denote by If E and D are the field and induction averaged in this way, then, by the 
definition of 

D = en,iJ. (9.1) 

If all the particles in the mixture are isotropic, and the differences in their permittivities 
are small in comparison with £ itself, it is possible to calculate in a general form which is 
correct as far as terms of the second order in these differences. 


t In these three Problems the body is assumed to be in a vacuum. 
J In a longitudinal field the solution is clearly Ej = G. 
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We write the local field as E = E + ^E, and the local permittivity as e + de, where 

s = (l filled I (9.2) 

is obtained by averaging over the volume. Then the mean induction is 


D = (£ + &)(E + ^E) = eE + ie^E, (9.3) 

since the mean values of Se and dE are zero by definition. In the zero-order approximation 
= e; the first non-zero correction term will, of course, be of the second order in Se, as we 
see from (9.3). 

From the non-averaged equation div D = 0 we have, as far as small terms of the first 
order, _ 

div [(£ -I- ^£)(E -I- ^E) ] = ediv^E -I- E • grad Se = 0. (9.4) 


The averaging of the product ^e^E in (9.3) is done in two stages. We first average over the 
volume of particles of a given kind, i.e. for a given ^e. The value of ^E thus averaged is easily 
obtained from equation (9.4): on account of the isotropy of the mixture as a whole. 

— ^ 

dx dy ^ dz 

If E is in the x-direction, say, we have from (9.4) 


— dSe 


whence 


SE^ = - {EJ3e)Se. 


Since the direction of the x-axis is chosen arbitrarily, this equation may be written in the 
vector form ^E = — (1/3£)E&. Multiplying by Se and effecting the final averaging over all 
components of the mixture, we obtain &^E = - (1/3£)E(&)^. Finally, substituting this 
expression in (9.3) and comparing with (9.1), we have the required result: 


Emix = £-(1/3£)(&)^ 

This formula can be written in another manner if we put 


(9.5) 




s is accurate to terms of the second order. Then 
0 =?- 


(9.6) 

Thus we can say that, in this approximation, the cube root of e is additive. 

Another limiting case for which an exact treatment is possible concerns the permittivity 
of an emulsion having an arbitrary difference between the permittivities e, of the medium 
and Ej of the disperse phase but only a small concentration of the latter, whose particles are 
assumed spherical. 

In the integral 1 f 

- (D-e,E)dfi ; 


= D-e,E 
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the integrand is zero except within particles of the emulsion. It is therefore proportional to 
the volume concentration c of the emulsion, and in calculating j^t we can assume that the 
particles are in an external field which equals the mean field E. Assuming the particles 
spherical and using formula (8.2), we obtain for the proportionality coefficient between 
D and E 

Emix = £i + 3cei (£2 — 61 )/(e2 + 2ei). (9.7) 

This formula is correct to terms of the first order in c. When Ei and £2 are nearly equal it is 
the same (to the first order in c and the second in £2 - Ei ) as the result given by formula (9.5) 
for small c. 


§10. Thermodynamic relations for dielectrics in an electric field 

The question of the change in thermodynamic properties owing to the presence of an 
electric field does not arise for conductors. Since there is no electric field inside a 
conductor, any change in its thermodynamic properties amounts simply to an increase in 
its total energy by the energy of the field which it produces in the surrounding space.f This 
quantity is quite independent of the thermodynamic state (and, in particular, of the 
temperature) of the body, and so does not affect the entropy, for example. 

On the other hand, an electric field penetrates into a dielectric and so has a great effect 
on its thermodynamic properties. To investigate this effect, let us first determine the work 
done on a thermally insulated dielectric when the field in it undergoes an infinitesimal 
change. 

The electric field in which the dielectric is placed must be imagined as due to various 
external charged conductors, and the change in the field can then be regarded as resulting 
from changes in the charges on these conductors.^ Let us suppose for simplicity that there 
is only one conductor, with charge e and potential cj). The work which must be done to 
increase its charge by an infinitesimal amount Se is 

SR = 4>de-, (10.1) 

this is the mechanical work done by the given field on a charge de brought from infinity 
(where the field potential is zero) to the surface of the conductor, i.e. through a potential 
difference of cj). We shall put SR in a form which is expressed in terms of the field in the 
space filled with dielectric which surrounds the conductor. 

If D„ is the component of the electric induction vector in the direction of the normal to 
the surface of the conductor (out of the dielectric and into the conductor), then the surface 
charge density on the conductor is —D^jAn, so that 


t We here neglect the energy of the attachment of the charge to the substance of the conductor; this will be 
discussed in §23. . . r 

{ The final results which we shall obtain involve only the values of the field inside the dielectric, and therefore 
are independent of the origin of the field. For this reason there is no need for special discussion of the case where 
the field is produced, not by charged conductors, but (for instance) by extraneous charges placed in the dielectric 
itself or by pyroelectric polarization of it (§13). 
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Since the potential </) is constant on the surface of the conductor, we can write 

dR = (j)Se= ^ |div(<;^)^D)df. 

The last integral is taken over the whole volume outside the conductor. Since the varied 
field, like the original field, must satisfy the field equations, we have div = 0, and so 
(jiv ((l)ST>) = ^ div • grad ^ = —E- ^ D. Thus the following important formula is 

obtained: ^ j (E -^0/471) dK (10.2) 

It should be emphasized that the integration in (10.2) is over the whole field, including the 
vacuum if the dielectric does not occupy all space outside the conductor. 

The work done on a thermally insulated body is just the change in its energy at constant 
entropy. Hence the expression (10.2) must be included in the thermodynamic relation 
which gives the infinitesimal change in the total energy of the body; the latter contains also 
the energy of the electric field. Denoting the total energy by >11. we therefore have 

^■^ = 7'^,^ + ^|E-^Ddr, (10.3) 

where T is the temperature of the body and .9" its entropy.f 
Accordingly we have for the total free energyj ^ — T 9’ 

39^ = -9Sr+^ E-SDdK (10.4) 

471 J 

Similar thermodynamic relations can be obtained for the quantities pertaining to unit 
volume of the body. Let U, S and p be the internal energy, entropy and mass of unit volume. 
It is well known that the ordinary thermodynamic relation (in the absence of a field) for the 
internal energy of unit volume is dC/ = LdST Cdp, where C is the chemical potential of the 
substance.§ In the presence of a field in a dielectric, there must be added the integrand in 
(10.3): 

dC/ = 7'dS + Cdp + E-00 471. (10.5) 

For the free energy per unit volume of the dielectric, F = U — TS, we therefore have 

dF = -SdT+Cdp + E-dE>An. (10.6) 

’^^^^bons are the basis of the thermodynamics of dielectrics, 
re ^ ^ ^ thermodynamic potentials with respect to S. p. D and T. p. D 

espectively. In particular, we can obtain the field by differentiating these potentials with 
spect to the components of the vector D: 

E = 4n{dUldE>\^ = 47i(df/cD)j.p. (10.7) 


(10.4) the volume of the body is asst 
3nd so the volume no longer chara( 


» inis quantity is meaningful only when the 
8 See sp 1, §24. Instead of the mass density we 
lere nt is the mass of one particle. The chemici 


lit in general it becomes inhomogeneous in 
of the body. 

tant throughout the body. 

;r of particles N per unit volume; p = Nm. 
nit mass and p per particle are related by 
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The free energy is more convenient in this respect, since it is to be differentiated at constant 
temperature, whereas the internal energy must be expressed in terms of the entropy, which 
is less easy. 

Together with U and F, it is convenient to introduce thermodynamic potentials in which 
the components of the vector E, instead of D, are the independent variables. Such are 

U =U-E-'D/4n, F = F~E-'DI4n. (10.8) 

On differentiating these we have 

di7 = 7'dS + Cdp-D-dE/47i, 1 
df = - Sdr + Cdp - D • dE/47i. J 

Hence, in particular, 

D = ~4n(dU/dEl_^ = -4n{dF/dE\^^. (10.10) 

It should be noticed that the relation between the thermodynamic quantities with and 
without the tilde is exactly that which occurs in §5 for the energy of the electrostatic field of 
conductors in a vacuum. For the integral jE-DdF can be transformed in an exactly 
similar manner to the one at the beginning of §2, with the equation div D = 0 inside the 
dielectric and the boundary condition D„ = 4na on the surfaces of conductors: 

— E-DdF= grad(/)-DdF 

471 J 471 J 

= LD„d/= ^ (10.11) 

471 „ J 

Hence we have for the internal energy, for example, 

^_|^dF= (10.12) 

in agreement with the definition (5.5). 

It is useful to derive also the formulae for infinitesimal changes in these quantities, 
expressed in terms of the charges and potentials of the conductors (the sources of the field). 
For example, the variation in the free energy (for a given temperature) is 

For the variation of # we have 

= (SS^yr-^Y.4>aea = - ( 10 . 14 ) 

We can say that the quantities without the tilde are the thermodynamic potentials with 
respect to the charges on the conductors, while those with it are thermodynamie potentia s 
with respect to their potentials. 

It is known from thermodynamics that the various thermodynamic potentials have the 
property of being minima in a state of thermodynamic equilibrium, relative to various 
changes in the state of the body. In formulating these conditions of equilibrium in an 
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electric field, it is necessary to say whether changes of state with constant charges on the 
conductors (the field sources) or those with constant potentials are being considered. For 
example, in equilibrium and # are minima with respect to changes in state occurring at 
constant temperature and (respectively) constant charges and potentials of the conductors 
(the same is true for ^ and at constant entropy). 

If any processes (such as chemical reactions) which are not directly related to the electric 
field can occur in the body, the condition of equilibrium with respect to these processes is 
that F be a minimum for given density, temperature and induction D, or that F be a 
minimum for constant density, temperature and field E. 

Hitherto we have made no assumptions concerning the dependence of D on E, so that 
all the thermodynamic relations derived above are valid whatever the nature of this 
dependence. Let us now apply them to an isotropic dielectric, where a linear relation 
D = eE holds. In this case integration of (10.5) and (10.6) gives 


U = Uo{S,p) + D^l^ne, 
F==FoiT,p) + D^l^ne. 


(10.15) 


where C/q and Fq pertain to the dielectric in the absence of the field. Thus in this case the 
quantity 

D^/Sne = eE^/Sn = ED/Sn (10.16) 


is the change in the internal energy (for given entropy and density) or in the free energy (for 
given temperature and density), per unit volume of the dielectric medium, resulting from 
the presence of the field. 

The expressions for the potentials U and F are similarly 


U =Uo{S,p)-eE^ISn, 
F = Fo(Tp)-eE^/Sn. 


(10.17) 


We see that the differences L — C/q and (7 — C/q in this case differ only in sign, as they did for 
an electric field in a vacuum (§5), In a dielectric medium, however, this simple result holds 
good only when there is a linear relation between D and E. 

We shall write out also, for future reference, formulae for the entropy density S and the 
chemical potential C, which follow from (10.15): 


(I 


= So(F.p) + 


Sn V^F ); 




= Co(F,p)- 


(10.18) 

(10.19) 


A.d 87r\5p// 

These quantities, of course, differ from zero only inside the dielectric. 

The total free energy is obtained by integrating (10.15) overall space. By (10.11) we have 


^ -^O = Je -DdF/gTC = 


(10.20) 
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This last expression is formally identical with the energy of the electrostatic field of 
conductors in a vacuum. The same result can be obtained directly by starting from the 
variation (10.13) for an infinitesimal change in the charges on the conductors. In the 
present case, when D and E are linearly related, all the field equations and their boundary 
conditions are also linear. Hence the potentials of the conductors must (as for the field in a 
vacuum) be linear functions of their charges, and integration of equation (10.13) gives 
( 10 . 20 ). 

It should be emphasized that these arguments do not presuppose the dielectric to fill all 
space outside the conductors. If, however, this is so, we can go further and use the results at 
the end of §7 to draw the following conclusion. For given charges on the conductors, the 
presence of the dielectric medium reduces by a factor e both the potentials of the 
conductors and the field energy, as compared with the values for a field in a vacuum. If, on 
the other hand, the potentials of the conductors are maintained constant, then their 
charges and the field energy are increased by a factor e. 

PROBLEM 

Determine the height h to which a liquid rises in a vertical plane capacitor. 

Solution . For given potentials on the capacitor plates. ^ must be a minimum. 3^ includes the energy \pgh^ 
of the liquid under gravity. From this condition we easily obtain h = (e — \)E^l^npg. 


§11. The total free energy of a dieleetrie 
The total free energy (or the total internal energy '^), as defined in §10, includes the 
energy of the external electric field which polarizes the dielectric; this field may be imagined 
as being produced by a particular assembly of conductors with specified total charges. It is 
also meaningful to consider the total free energy less the energy of the field which would be 
present in all space if the body were absent. We denote this field by G. Then the total free 
energy in this sense is 

j(f-GV87c)dF, (111) 

where F is the free energy density. Here we shall denote this quantity by the letter which 
in §10 signified |Fdr. It should be emphasized that the difference between the two 
definitions of is a quantity independent of the thermodynamic state and properties of 
the dielectric, and hence it has no effect on the fundamental differential relations of 
thermodynamics pertaining to this quantity.! 

Let us calculate the change in resulting from an infinitesimal change in the field which 
occurs at constant temperature and does not destroy the thermodynamic equilibrium o 
the medium. Since = E • dTHAn, we have dlf = j(E • — G • ^G)d 1 An. This expres¬ 

sion is identically equal to 

J(D-G)-^GdF/47c+jE-(^D-^G)dF/47c-J(D-E)-()GdF/47r. (11.2) 


t It may be noted that there would be no sense in subtracting from F. because E is the field as modified 

by the presence of the dielectric, and so the difference f - 8a could not be regarded as the free energy density 
of the dielectric as such. 
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In the first integral we write = — grad d(f>Q (where (f>Q is the potential of the field ®) and 
integrate by parts; 

jgrad^<;«)o-(D-e)dF= § -(f)-df-j^^odiv(D-)dF. 

It is easy to see that both the integrals on the right-hand side are zero. For the volume 
integral this follows at once from the equations div D = 0 and div (f = 0 which the 
induction in the dielectric and the field in the vacuum must respectively satisfy. The surface 
integral is taken over the surfaces of the conductors which produce the field and over an 
infinitely distant surface. The latter of these is, as usual, zero, and for each of the conductors 
= constant, so that 1 3(f )(,(D — ®) • df = 3(f)(, ^(D — ) • df. The field , by definition, 

is produced by the same sources as the field E and induction D (i.e. by the same conductors 
with given total charges c). Hence the two integrals | D„d/and | G„d/are both equal to 47re, 
and their difference is zero. 

Similarly, we can see that the second term in (11.2) is also zero, by putting E = — grad (p 
and using the same transformation. Finally, we have 

3^= - j(D-E)-^<fdF/47c= -jP-^edF. (11.3) 

It should be noticed that the integral in this expression need be taken only over the volume 
of the dielectric medium, since outside it P = 0. 

However, we must emphasize that the integrand P-^G cannot be interpreted as the 
variation of the free energy density in the same way as was done with formulae (10.3), (10.4). 
First of all, this density must exist outside the body, which modifies the field in the 
surrounding space also. It is clear, moreover, that the energy density at any point in the 
body can depend only on the field actually present there, and not on the field which would 
be present if the body were removed. 

If the external field G is uniform, then 

3^ = -3(i-^PdV= -^^-3(6, (11.4) 

where ^ is the total electric dipole moment of the body. Hence the thermodynamic 
identity for the free energy can be written in this case as 

(11.5) 

The total electric moment of the body can therefore be obtained by differentiating the total 
free energy; 

^ (11.6) 

The latter formula can also be obtained directly from the general statistical formula 

JWJFa = I3k\, 

where jf is the Hamiltonian of the body as the system of its component particles, and /. is 
n> parameter characterizing the external conditions in which the body is placed; see SP 1, 
— (P ^ body in a uniform external field G, the Hamiltonian contains a term 

where ^ is the dipole moment operator. Taking G as the parameter /.. we obtain 
'ne required formula. 

If D and E are connected by the linear relation D = aE, we can similarly calculate 
’^Plicitly not only the variation 3.'W but ^ itself. We have 

3^ = J(E-D-G2)df 87t. 
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This can be identically transformed into 

jr _ jTp = j (E + e) • (D - G)dF/S tt - j G • (D - E)dF/S tt, 

The first term on the right is zero, as we see by putting 
E + G = - grad (0 +00) 

(11.7) 

( 11 . 8 ) 


and again using the same transformation. Hence we have 
^-3i^^{V,T)= -ijG-PdF. 
In particular, in a uniform external field 

J^-J^o(1^.7')= -iG-^. 


This last equation can also be obtained by direct integration of the relation (11.3) if we 
notice that, since all the field equations are linear when D = eE, the electric moment ^ 
must be a linear function of G. 

The linear relation between the components of and G can be written 

as for conductors (§2). For a dielectric, however, the polarizability depends not only on the 
shape but also on the permittivity. The symmetry of the tensor a,*,, mentioned in §2, follows 
at once from the relation (11.6); it is sufficient to notice that the second derivative 
/oG^cGj = = - Fa,^ is independent of the order of differentiation. 

Formula (11.7) becomes still simpler in the important case where e is close to 1, i.e. 
the dielectric susceptibility k = (e— l)/47r is small. In this case, in calculating the energy, 
we can neglect the modification of the field due to the presence of the body, putting 
P = kE S kG. Then 

-ikJGMF, (11-10) 


the integral being taken over the volume of the body. In a uniform field, the dipole moment 
&* = Vk(S, and the free energy is 

In the general case of an arbitrary relation between D and E, the simple formulae (11.7) and 
(11.8) do not hold. Here the formula 


may be useful in calculating ; its derivation is obvious after the above discussion. The 
two integrands differ by 


ED 

87c 




_J_(D-G)-(E + G); 


after substitution of E = -grad 0, G = - grad 0o and integration over all space the 
result is zero. In (11.12), as in (11.7), the second integrand is zero outside the body, where 
P = 0 and F = E^l&n, so that the integration is taken only over the volume of the body- 
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Derive the formula which replaces (11.7) when the body is not in a vacuum but in a medium of permittivity e*®'. 
Solution. Using the same transformations as before, we find 

|G-(D-£'‘"E)dK 


§12. Electrostriction of isotropic dielectrics 

For a solid dielectric in an electric field the concept of pressure cannot be defined as for 
an isotropic body in the absence of a field, because the forces acting on a dielectric (which 
we shall determine in §§15, 16) vary over the body, and are anisotropic even if the body 
itself is isotropic. An exact determination of the deformation (electrostriction) of such a 
body involves the solution of a complex problem in the theory of elasticity. 

However, matters are much simpler if we are interested only in the change in the total 
volume of the body. As we saw in §5, the shape of the body may then be regarded as 
unchanged, i.e. the deformation may be regarded as a uniform volume compression or 
expansion. 

We shall neglect the dielectric properties of the external medium (the atmosphere, for 
instance) in which the body is situated, i.e. we suppose that e = 1. This medium thus serves 
merely to exert a uniform pressure on the surface of the body, which we shall denote by P. 
If -W is the total free energy of the body, then we have the thermodynamic relation 
P = - (d:^ ISV)t-, and accordingly the expression for the differential contains a term 
~PdV. For example, in a uniform external field, (11.5) becomes 

d^= -S^dT-PdV-£P-d(S:. 

We introduce the total thermodynamic potential (Gibbs function) of the body in 
accordance with the usual thermodynamic relation 


&o = dt'+PV. ( 12 . 1 ) 

The differential of this quantity in a uniform external field is 

dso=-5^d7'+FdP-^-dG. (12.2) 

The change in the thermodynamic quantities in an external electric field is usually a 
tfiS ively small quantity. It is known from the theorem of small increments, SP 1 (15.12), 
ther ^ change in the free energy (for given T and F) is equal to the small change in the 
potential (for given T and P). Hence, besides (11.8), we can write 


^ = (12.3) 

for th^ *^™odynamic potential of a body in a uniform external field. Here is the value 

the f absence of the field and for given values of P and l\ while in (11.8) is 

ree energy in the absence of the field and for given values of V and P. 

'taking explicit the dependence of the dipole moment on V and G according to (11 9) 
can rewrite (12.3) as bis 

»“ = »Oo(P.T)-iFa,.*G,G^, (12.4) 

here the correction term must be expressed as a function of temperature and pressure by 
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means of the equation of state for the body in the absence of the field. In particular, for a 
substance with small dielectric susceptibility this formula becomes simply 

go = go^(P.T)-^KVl&^- (12-5) 

cf. (11.11). . 

The required change in volume I — f o in the external field can now be obtained 
immediately by differentiating cp with respect to pressure for constant T and G. For 
example, from (12.5) we have 

V-V^=-\^^[d(KV)ldP\. (12.6) 

This quantity may be either positive or negative (whereas, in electrostriction of conductors, 
the volume is always greater in the presence of the field). 

Similarly, we can calculate the amount of heat Q absorbed in a dielectric when an 
external electric field is isothermally applied (the external pressure being constant).) 
Differentiation of go - cp^ with respect to temperature gives the change in the entropy of 
the body, and by multiplying this by T we obtain the required quantity of heat. For 
example, from (12.5) we obtain 

Q = (12.7) 

Positive values of Q correspond to absorption of heat. 


PROBLEMS 

Problem 1. Determine the change in volume and the electrocaloric effect for a dielectric ellipsoid in a 
uniform electric field parallel to one of its axes. 

Solution. From formulae (12.3) and (8.10) we have 


The relative change in volume is found to bet 

F-Fp G^r £-1 1 1 

F ~ 87tLn£ + l-n^^ («£ + 1 -n)^ W/r-l’ 


and the electrocaloric effect 


_ 1 

(n£+l-n)^V5F/pJ 


where 1,K = - (1 F)(5 F/fiP)p is the compressibility ofthe body, and a = (l;F)(fiF/fi7')p the thermal expansion 

coefficient. 

In particular, for a plane disc in a field perpendicular to it, n = 1, so that 


F- Fp 

e 



t If the body is thermaUy insulated, the application of the field results in a change of tempera 

AT-= -O/'^p, where'^P is the heat capacity at constant pressure. 

t In tte liinit £ -. oo, we find as the change in volume of a conducting ellipsoid ( V - Fp)/ V = (£^/SnKn. For 


sphere, n = 3, and we recover the result in §5, Problem 4. 
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for a similar disc (or any cylinder) in a longitudinal field, n = 0, and 

V-Vo G^rc-1 /'5£\1 TV&V fSe\~\ 

Problem 2. Determine the difference between the heat capacity of a plane disc in a field perpendicular to 
it, with a constant potential difference between its faces, and the heat capacity at constant induction, the 
external pressure being maintained constant in each case.t 

Solution. According to the results of Problem 1, the entropy of the disc is 




The induction inside the disc is the same as the external field: D = G. Hence, to calculate the heat capacity we 
must differentiate for constant G. The potential difference between the faces of the disc is (j) = El = G//S. 
where / is its thickness. For a uniform compression or expansion of a body, / is proportional to Vi. Hence, to 
calculate the heat capacity we must differentiate for constant G Fs/*- The required difference is found to be 




Problem 3. Determine the electrocaloric effect in a homogeneous dielectric whose total volume is kept 
constant. 


Solution. Strictly speaking, when an external field is applied the density of the body changes (and ceases to 
be uniform), even if the total volume is kept constant. In calculating the change in the total entropy, however, we 
can ignore this and assume the density p constant at every point.J 
According to (10.18) the total entropy of the body is 

where the integration is over the volume of the body. The amount of heat absorbed is 


Problem 4. 
constant. 


rr).!'’'" 


Determine the difference (see Problem 2) when the total volume of the disc is kept 


Solution. When the volume, and therefore the thickness, of the disc are constant, differentiation for 
constant potential difference is equivalent to differentiation for constant field E. Using the formula of Problem 3 
lor the entropy we have 


TVE^ rB£\ ^ TVtS^ rBe\ 
4to ” 47r£^ V^/ 


with ^ capacitor consists of two conducting surfaces at a distance h apart which is small compared 

and ‘dimensions; the space between them is filled with a substance of permittivity £,. A sphere of radius a<ih 
permittivity is placed in the capacitor. Determine the change in capacitance. 


plates the sphere be placed in the capacitor in such a way that the potential difference (p between the 

sphere'^^'^unchanged. The free energy for constant potentials of the conductors is #. In the absence of the 
hrouelif^ ' "'d'^te Cq is the original capacitance. Since the sphere is small, we may imagine it to be 

is ea * 1 ^ ™idbrm field (£ = tp/h, and the change in # is small. The small change in ^ at constant potentials 

Problem°and*(^)* ^ constant charges on the sources of the field. Using the formula derived in §11, 


Union.? capacity of a disc between the plates of a plane capacitor in circuit with a constant e.m.f. In an 

nnected capacitor with constant charges on the plates, the heat capacity of the disc is 
the^ / die change in density dp is of the second order with respect to the field (oc F-^hand the consequentchange in 
ids is of *0 fom'l) order: the term in the change of total entropy which is linear in 6p is 

0/ P)J dpdt, and the integral is zero because the total mass of the body is unaltered. 
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whence the required capacitance is 

C = Co + £, (£2 - £, )/(2£, + £2 )'>"'■ 

§13. Dielectric properties of crystals 

In an anisotropic dielectric medium (a single crystal) the linear relation between the 
electric induction and the electric field is less simple, and does not reduce to a simple 
proportionality. 

The most general form of such a relation is 

D,. = Do,-fe.„£^, (13.1) 

where Dq is a constant vector, and the quantities form a tensor of rank two, called the 
permittivity tensor (or the dielectric tensor). The inhomogeneous term Dq in (13.1) does not, 
however, appear for all crystals. The majority of the types of crystal symmetry do not admit 
this constant vector (see below), and we then have simply 

Di = (13.2) 

The tensor e,^ is symmetrical; 

£ik = hr (13.3) 

In order to prove this, it is sufficient to use the thermodynamic relation (10.10) and to 
observe that the second derivative — 4nd^F/dE^dEi = dDJdE^ — is independent of the 
order of differentiation. 

For f itself we have (when (13.2) holds) the expression 

F ^ Fo-Ei^EiEJSn. (13.4) 

The free energy F is 

F = F + EiDJ4n = Fq + \kDiDJSn. (13.5) 

Like every symmetrical tensor of rank two, the tensor can be brought to diagonal 
form by a suitable choice of the coordinate axes. In general, therefore, the tensor 6;^ 
determined by three independent quantities, namely the three principal values e 

All these are necessarily greater than unity, just as e > 1 for an isotropic body (see §14) 
The number of different principal values of the tensor e,j may be less than three for 
certain symmetries of the crystal. 

In crystals of the triclinic, monoclinic and orthorhombic systems, all three principal values 
are different; such crystals are said to be biaxial.'^ In crystals of the triclinic system, the 
directions of the principal axes of the tensor 6,^ are not uniquely related to any directions iP 
the crystal. In those of the monoclinic system, one of the principal axes must coincide wiW 
the twofold axis of symmetry or be perpendicular to the plane of symmetry of the crysta. 
In crystals of the orthorhombic system, all three principal axes of the tensor e^^^ are 
crystallographically fixed. 

Next, in crystals of the tetragonal, rhombohedral and hexagonal systems, two of tn 
f This name refers to the optical properties of the crystals; see §§98, 99. 
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three principal values are equal, so that there are only two independent quantities; such 
crystals are said to be uniaxial. One of the principal axes coincides with the fourfold, 
threefold or sixfold axis of crystal symmetry, but the directions of the other two principal 
axes can be chosen arbitrarily. 

Finally, in crystals of the cubic system all three principal values of the tensor e(j. are the 
same, and the directions of the principal axes are entirely arbitrary. This means that the 
tensor e.-jt is of the form i.e. it is determined by a single scalar e. In other words, as 
regards their dielectric properties, crystals of the cubic system are no different from 
isotropic bodies. 

All these fairly obvious symmetry properties of the tensor become particularly clear if 
we use a concept from tensor algebra, the tensor ellipsoid, the lengths of whose semiaxes are 
proportional to the principal values of a symmetrical tensor of rank two. The symmetry of 
the ellipsoid corresponds to that of the crystal. For instance, in a uniaxial crystal the tensor 
ellipsoid degenerates into a spheroid completely symmetrical about the longitudinal axis; it 
should be emphasized that, as regards the physical properties of the crystal which are 
determined by a symmetrical tensor of rank two, the presence of a threefold or higher axis 
of symmetry is equivalent to complete isotropy in the plane perpendicular to this axis. In 
cubic crystals, the tensor ellipsoid degenerates into a sphere. 

Let us now examine the dielectric properties of crystals for which the constant term Dq 
appears in (13.1). The presence of this term signifies that the dielectric is spontaneously 
polarized even in the absence of an external electric field. Such bodies are said to be 
pyroelectric. The magnitude of this spontaneous polarization is, however, in practice 
always very small (in comparison with the molecular fields). This is because large values of 
Do would lead to strong fields within the body, which is energetically very unfavourable 
and therefore could not correspond to thermodynamic equilibrium. The smallness of Dq 
also ensures the legitimacy of an expansion of D in powers of E, of which (13.1) represents 
the first two terms. 

The thermodynamic quantities for a pyroelectric body are found by integrating the 
relation —AndP/dE^ = D; = D(,i + e^^E^, whence 

P = Fo- ^ikEiE^l^n - E^DjAn. (13.6) 

The free energy is 


F = F + E^D^iAn = F^ + e^^EiEJ^n 

= ^o + e“\k(A-Do,)(D,-Doj)/87r. (13.7) 

^ noted that the term in F linear in £, does not appear in F.f 

he total free energy of a pyroelectric can be calculated from formula (11.12) by 
ubstituting (13.7) and (13.1). If there is no external field, G = 0, and we have simply 


■^ = J [To-(E-Do/STTUdr. (13.8) 

den ‘he free energy of a pyroelectric in the absence of an external field 

pends, like the field E, not only on the volume of the body but also on its shape. 


should also be noted that in these formulae we neglect the piezoelectric effect, i.e. the effect of internal 
ses nn the electric properties of a body; see §17. The formulae given here are therefore, strictly speaking, 
e uniform throughout the body, and internal stresses do not arise. 


applicable only when t\ 
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As has already been pointed out, the phenomenon of pyroelectricity is not possible 
for every crystal symmetry. Since, in any symmetry transformation, all the properties 
of the crystal must remain unchanged, it is clear that the only crystals which can be pyro¬ 
electric are those in which there is a direction which is unchanged (and, in particular, not 
reversed) in all symmetry transformations, and that this will be the direction of the con¬ 
stant vector Dq. 

This condition is satisfied only by those symmetry groups which consist of a single axis 
together with planes of symmetry which pass through the axis. In particular, crystals 
having a centre of symmetry certainly cannot be pyroelectric. We may enumerate those out 
of the 32 crystal classes in which pyroelectricity occurs: 
triclinic system: Cj 
monoclinic system: Q, Cj 
orthorhombic system: Cj,, 
tetragonal system: C4, 
rhombohedral system: C3, 
hexagonal system: Cg, Cg„. 

There are, of course, no pyroelectric cubic crystals. In a crystal of class Cj the direction of 
the pyroelectric vector Dq is not related to any direction fixed in the crystal; in one of class 
Cj, it must lie in the plane of symmetry. In all the remaining classes listed above the 
direction of Dq is that of the axis of symmetry.t 

It should be mentioned that, under ordinary conditions, pyroelectric crystals have zero 
total electric dipole moment, although their polarization is not zero. The reason is that 
there is a non-zero field E inside a spontaneously polarized dielectric. Since a body usually 
has a small but non-zero conductivity, the presence of a field gives rise to a current, which 
flows until the free charges formed on the surface of the body annihilate the field inside it. 
The same effect is produced by ions deposited on the surface from the air. Experimentally, 
pyroelectric properties are observed when a body is heated and a change in its spontaneous 
polarization is detected. 


PROBLEMS 


Problem 1. Determine the field of a pyroelectric sphere in a vacuum. 

Solution. The field inside the sphere is uniform, and the field and induction are related by 2E = — D (as 
follows from (8.1) when © = 0, i.e. when there is no applied external field). Substituting in (!3.1), we obtain t £ 
equation 2£j + £jt£t = -Doe We take the coordinate axes to be the principal axes of the tensor Cj*. Then this 
equation gives £; = — + e*'*). The polarization of the sphere is Pj = (D, — E^j 47i = 3DQ,/47r(2 + e')- The 

field outside the sphere is that of an electric dipole with moment = VV. 

Problem 2. Determine the field of a point charge in a homogeneous anisotropic medium.J 

Solution. The field of a point charge is given by the equation div D = 47te^(r) (the charge being at the 
origin). In an anisotropic medium £, = £,,,£1= —eii,S<j)/dx^; taking the coordinate axes x,y,z along 
principal axes of the tensor we obtain for the potential the equation 

+E^^^d^4>/ey^ + = -^nedir). 


t In ref erring to symmetry conditions, we are regarding the crystal as an infinite medium. For a finite cry » 
the exact value of the total dipole moment may depend (in an ionic crystal) on which crystal planes lorm its la 
and whether these planes contain ions of only one sign or are electrically neutral. However, in 
electrodynamics, which implies averaging over physically infinitesimal volumes, it is reasonable to consider t^ 
the position of the faces relative to the crystal lattice is averaged also. In consequence of this averaging, Uo 
vanishes in any non-pyroelectric finite crystal, and in a pyroelectric one is independent of the face configuration. 

} In Problems 2-6 the anisotropic dielectric is assumed to be non-pyroelectric. 
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variables 

X' = x/y e'-', y = y, y e'"’. z' = z y e'^'. 
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(1) 


this becomes 


^(r') 


which formally differs from the equation for the field it 


<t> 


^(eWe'^'e'"') [i 


vacuum only in that 


is replaced by e' 


In tensor notation, independent of the system of coordinates chosen, we have 
0 = e/y(|E|E-‘,^x,x^), 
where |e| is the determinant of the tensor e^. 

Problem 3. Determine the capacitance of a conducting sphere, with radius a, in an anisotropic dielectric 
medium. 


Solution. By the transformation shown in Problem 2. the determination of the field of a sphere with charge 
E in an anisotropic medium reduces to the determination of the field in a vacuum due to a charge e' distributed 
over the surface of the ellipsoid e^^x\x\ = e‘**x'^ + e'*'*/^ + e'^’z'^ = a^. Using formula (4.14) for the potential due 
to an ellipsoid, we find the required capacitance to be given by 


2y(E<* 




Problem 4. Determine the field in a flat anisotropic plate in a uniform external field (f. 

Solution. From the condition of continuity of the tangential component of the field it follows that 
E = <E + An, where E is the uniform field inside the plate, n a unit vector normal to its surface, and A a constant. 
The constant is determined from the condition of continuity of the normal component of the induction, 
"■ D = n • (E, or n^E^E^ = niEj^t£^ + AE^n^n^ = Cjtij. Hence A = —{Ei^—S^)njt£jE,„n,n„. 

In particular, if the external field is along the normal to the plate (the z-direction), we have 


^ = G(1-e„)/e„. 

If it is parallel to the plate and in the x-direction, then 

'4 = - I£e„/e22. 

fi Determine the torque on an anisotropic dielectric sphere, with radius a, in a uniform external 

neid e in a vacuum. 

£ ^®^ution. According to (8.2) we have for the field inside the sphere = 3G„/(£'*’ + 2), and similarly for E^, 
2- ere the axes of x, y, z are taken to be the principal axes of the tensor e^. Hence the components of the dipole 
"loment of the sphere are 



components of the torque on the sphere are 

K, = = 3fl^G„G,,(£«>-E<>’')/(E'*' + 2)(E'ri + 2), 

*'’'1 Similarly for K,. 

^ infinite anisotropic medium contains a spherical cavity with radius a. Express the field in the 

y *n terms of the uniform field far from the cavity. 

transformation (1) of Problem 2 reduces the equation for the field potential in the medium to 
that f ^yqttBtion for the field in a vacuum. The equation for the field potential in the cavity is transformed into 
or the potential in a medium with permittivities 1 1 l/e^^K Moreover, the sphere is transformed into 
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an ellipsoid with semiaxes a/Ve'"’, Let n'*', «'«, n'^' be the depolarizing factors of such an 

ellipsoid (given by formulae (4.25)). Applying formula (8.7) to the field of this ellipsoid, we obtain the relation 




and similarly for the y' and z' directions. Returning to the original coordinates, we have d(}>/dx' - ^e'**(50/<5x 
= - so that the field in the cavity is 


§14. The sign of the dielectric susceptibility 

To elucidate the way in which the thermodynamic quantities for a dielectric in a field 
depend on its permittivity, let us consider the formal problem of the change in the electric 
component of the total free energy of the body when e undergoes an infinitesimal change. 

For an isotropic (not necessarily homogeneous) body we have by (10.20) ^ - .^o 
= |(D^/87ie)dr. When £ changes, so does the induction, and the variation in the free 
energy is therefore 


J J 8 ; 


The first term on the right is the same as (10.2), which gives the work done in an 
infinitesimal change in the field sources (i.e. charges on conductors). In the present case, 
however, we are considering a change in the field but no change in the sources. This term 
therefore vanishes, leaving 


= - |(&/e")(D"/87r)dF = - | &(£V87i)dF. (14.1) 

From this formula it follows that any increase in the permittivity of the medium, even if 
in only a part of it (the sources of the field remaining unchanged), reduces the total free 
energy. In particular, we can say that the free energy is always reduced when uncharged 
conductors are brought into a dielectric medium, since these conductors may (in 
electrostatics) be regarded as bodies whose permittivity is infinite. This conclusion 
generalizes the theorem (§2) that the energy of the electrostatic field in a vacuum is 
diminished when an uncharged conductor is placed in it. 

The total free energy is diminished also when any charge is brought up to a dielectri 
body from infinity (a process which may be regarded as an increase of £ in a certain volume 
of the field round the charge). In order to conclude from this that any charge is attracted to 
a dielectric, we should, strictly speaking, prove also that cannot attain a minimum o 
any finite distance between the charge and the body. We shall not pause here to prove t 
statement, especially as the presence of an attractive force between a charge and a dielectr 
may be regarded as a fairly evident consequence of the interaction between the charge a 
the dipole moment of the dielectric, which it polarizes. j- i trie 

We can deduce immediately from formula (14.1) the direction of motion of a dielect 
body in an almost uniform electric field, i.e. one which may be regarded as umfom oW 
the dimensions of the body. In this case is taken outside the integral, and the difieren 
^ -3E is a negative quantity, proportional to E^. In order to take a position in which 
free energy is a minimum, the body will therefore move in the direction of E increasing^ 
It can be shown independently of (14.1) that the total change in the free energy o 
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dielectric when it is placed in an electric field is negative.! This can be done by the use of 
thermodynamic perturbation theory, the change in the free energy of the body being 
regarded as the result of a perturbation of its quantum energy levels by the external electric 
field. According to this theory we have 






(14.2) 


see SP 1, (32.6). Here are the unperturbed levels, the matrix elements of the 
perturbing energy, and the bar denotes a statistical averaging with respect to the Gibbs 
distribution w„ = exp{(J^o 

The term V„„ in formula (14.2), which is linear in the field, is zero except in pyroelectric 
bodies. The quadratic change in the free energy, which is of interest here, is given by the 
remaining terms. It is evident that they are negative. 

On the other hand, it is clear from the derivation of (14.2) that the total free energy 
must be taken in this formula as described in §11, omitting the energy of the field which 
would exist in the absence of the body. The difference J* — is therefore given by the 
thermodynamic formula (11.7). Let us consider a long narrow cylinder placed parallel to a 
uniform external field G. The field within the cylinder is then G also, and its polarization 
P = (e - 1)G/47i, so that 

= -(e-l)TGV87r. 

Thus .-F — .Fo is negative only ife > 1. This leads to the conclusion mentioned in §7 and 
already made use of, namely that the permittivity of all bodies exceeds unity, and the 
dielectric susceptibility k = (e — l)/47r is therefore positive. 

In the same way we can prove the inequalities e‘‘’ > 1 for the principal values of the 
tensor in an anisotropic dielectric medium. To do so, it is sufficient to consider the 
energy of a field parallel to each of the three principal axes in turn. 


§15. Electric forces in a fluid dielectric 

The problem of calculating the forces (called ponderomotive forces) which act on a 
^ le ectric in an arbitrary non-uniform electric field is fairly complicated and requires 
separate consideration for fluids (liquids or gases) and for solids. We shall take first the 
ease, that of fluid dielectrics. We denote by fd V the force on a volume element d 1 
‘‘nd call the vector f the force density. 

for* well known that the forces acting on any finite volume in a body can be reduced to 
CO ees applied to the surface of that volume (see TE, §2). This is a consequence of the law of 
its of momentum. The force acting on the matter in a volume d 1 ’ is the change in 

enteri^"'^n*^"^ change must be equal to the amount of momentum 

mg the volume through its surface per unit time. If we denote the momentum flux 


the proportional to the square of the field is meant. It may be recalled that, in pyroelectric bodies, 

oange in the free energy contains also a term linear in the field, which is of no interest here. 
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ifdV=§o,,df, (15.1) 

where the integration on the right is over the surface of the volume V. The tensor is 
called the stress tensor. It is evident that is the ith component of the force 

on a surface element d/ (n being a unit vector along the normal to the surface outwards 
from the volume under consideration). 

Similarly, the total torque acting on a given volume also reduces to a surface integral, by 
virtue of the law of conservation of angular momentum. This reduction is possible because 
of the symmetry of the stress tensor which thus expresses the conservation law 

mentioned. 

On transforming the surface integral in (15.1) into a volume integral, we obtain 
= I (daiJdx^)dV, whence, since the volume of integration is arbitrary, 

fi = dOiJdx^. (15.2) 

This is a well-known formula giving the body forces in terms of the stress tensor. 

Let us now calculate the stress tensor. Any small region of the surface may be regarded 
as plane, and the properties of the body and the electric field near it as uniform. Hence, to 
simplify the derivation, we can with no loss of generality consider a plane-parallel layer of 
material (with thickness h and uniform composition, density and temperature) m an 
electric field which is uniform.^ This field may be imagined to be due to conducting planes, 
such as the plates of a capacitor, applied to the surfaces of the layer. 

Following the general method for determining forces, we subject one of the plates (the 
upper one, say) to a virtual translation over an infinitesimal distance whose direction is 
arbitrary and need not be that of the normal n. We shall suppose that the potential of the 
conductor remains unchanged at every point, and that the homogeneous deformation of 
the dielectric layer, resulting from the translation, is isothermal. 

A force is exerted by the layer on unit area of the surface. In the virtual 

displacement this force does work — £7;^ i. The work done in an isothermal deformation 

at constant potential is equal to the decrease in j Fd V, i.e. in hF per unit surface area. Thus 

a,ki,nk=--S(hF) = hSF + F5h. ( 15 . 3 ) 


The thermodynamic quantities for the fluid depend (for given temperature and field) 
only on its density; deformations which do not change the density (i.e. pure shears) do no 
affect the thermodynamic state. We can therefore write for an isothermal variation SF m a 
fluid 



D • fdF 


dp. 


An 


bp 

'e.t 

bp. 


(15.4) 


t The force component f, 
j We thus ignore any term 
terms, however, are vanishing! 
as any terms containing derir 


s not to be confused with the surface area component dji. 
i in the stress tensor depending on the gradients of temperature, field, etc. Thes 
' small in comparison with terms which do not contain derivatives, in the same way 
atives which might appear in the relation between D and E. 
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The change in the density of the layer is related to the change in its thickness by bp 
= — pbh/h. The variation of the field is calculated as follows. At a given point in space 
(with position vector r) there appears matter which was originally at r — u, where u is the 
particle displacement vector in the layer. Since, under the conditions stated (homogeneous 
deformation, and constant potential on the plates), each particle carries its potential with it, 
the change in the potential at a given point in space is ^(/) = (/)(r — u) — (/)(r) = - u • grad (p 
= u • E, where E is the uniform field in the undeformed layer. Since the deformation is 
homogeneous, however, we have 

a = z^/h, (15.5) 


where z is the distance from the lower surface. Hence the variation of the field is 

<5E= -n(E-^)/h. (15.6) 

Substituting the above expressions in (15.4) and using also the fact that Sh = = ^-n, 

we obtain 

(n ■ D)(^ • E)-^ + ^ • nf 

471 dp 




BE - 1 


Hence we have finally the following expression for the stress tensor: 

(^ik = [F-p(SFISp)E.TWik + EiDJ4n. 

In isotropic media, which are those here considered, E and D £ 

EiD^ = £^Dj, and the tensor (15.7) is symmetrical, as it should be.t 
If the linear relation D = eE holds, then 

F = Fo(p,T)-eE^ISn-, 

see (10.17). is the free energy per unit volume in the absence of the field. According to a 
Well-known thermodynamic relation, the derivative of the free energy per unit mass with 
respect to the specific volume is the pressure: 


(15.7) 
parallel. Hence 


(15.8) 


[sd/rtlp” 




.. 

e Eo ip, T) is the pressure which would be found in the medium in the absence of a field 
n for given values of p and T. Hence, substituting (15.8) in (15.7), we have 

e£.£^ 


In 




471 


(15.9) 


field expression becomes the familiar Maxwell stress tensor of the electric 


f It is not important that in this derivation E is parallel to 
°‘rection of E, not on that of n. 

+ 5>ee the first footnote to §5. 


obviously depend only on the 
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The forces exerted on the surface of separation by two adjoining media must be equal 
and opposite; where the quantities with and without the prime refer to 

the two media. The normal vectors n and n' are in opposite directions, so that 


At the boundary of two isotropic media the condition of equality of the tangential forces is 
satisfied identically. For, substituting (15.7) in (15.10) and taking the tangential coinponent, 
we obtain E,D„ = This equation is satisfied by virtue of the boundary conditions of 

continuity on E, and D„. The condition of equality of the normal forces is, however, a non¬ 
trivial condition on the pressure difference between the two media. 

For example, let us consider a boundary between a liquid and the atmosphere (for which 
we can put e = 1). Denoting by a prime quantities pertaining to the atmosphere, and using 
formula (15.9) for we have 






Using the boundary conditions D„ = eE„ = D’„ = E'„, we can rewrite this 

equation as 


This relation is to be taken as determining the density p of the liquid near its surface from 
the electric field in it. 

Let us now determine the body forces acting in a dielectric medium. Differentiating 
(15.9) in accordance with (15.2) gives 


d r „ fdE\ 1 £^ de ^ 1 r 


tf-E^+f(E,DA 

dx; dx^ J 


On using the equation div D = dDJdx, = 0, the expression in the brackets in the last term 
can be reduced to 


BE. dE^ ^ (BE^ BEi\ 


which is zero, since curl E = 0. Thus we have 


= _gradPo(P,n + ^gra‘l[£V(QJ-|^ 


grade 


(15.12) 


(H. Helmholtz, 1881). f r contains a 

If the dielectric contains extraneous charges with density Pex. the force f con 

further term E div D/47i, or. since div D = 4np^, 

(15.131 


PexE; 


however, it should not be supposed that this result is \ be 

In a gas as already mentioned in §7, we can assume the differe 
proportional to the density. Then pSs/Sp = a - 1, and formula (15.12) takes the simpler 

f= -gradPo + ^ (15.1't) 
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Formula (15.12) is valid for media of both uniform and non-uniform composition. In the 
latter case e is a function not only of p and T but also of the concentration of the mixture, 
which varies through the medium. In a body of uniform composition, on the other hand, e 
is a function only of p and T, and grad e can be written as 

grad e = (de/dFlp grad T -I- {flEldp^ grad p. 

Then (15.12) becomes 

f= -gr.dP,(p,n + |;gr.d[£»(QJ-|;(|)^8r.d7. (15.15) 

If the temperature also is constant throughout the body, the third term on the right is zero, 
and in the first term grad Pq can be replaced by p grad Co, m accordance with the 
thermodynamic identity for the chemical potential in the absence of a field, pdCo = 
dPo-SodT. Thus 

I- -pg,.d[^£„-|-(|)J 
= -p grade, (15.16) 

where C is the chemical potential in an electric field (see (10.19)). 

In particular, the condition of mechanical equilibrium f = 0 is, for constant 
temperature, 

C = Co - (£VStj) (Se/dp) r = constant, (15.17) 

in accordance with the thermodynamic condition of equilibrium. This condition can 
usually be written still more simply. The change in density of the medium due to the field is 
proportional to E^. Hence, if the medium is of uniform density in the absence of the field, 
we can put p = constant in the last two terms in (15.15) when the field is present; an 
allowance for the change in p is beyond the accuracy of formulae which assume the linear 
relation D = eE. Then, equating to zero f from (15.15), we obtain the equilibrium 
condition at constant temperature in the form 

Po(P,T)-(pE^/Sn)(dE/dp)j= constant, (15.18) 

'''hich differs from (15.17) in that Co is replaced by Pq/P- 
To close this section, we shall show how the expression (15.12) for the force may be 
crived directly from (14.1) if the calculation of the stress tensor is not required, 
ct us consider an infinite inhomogeneous dielectric medium subjected to a small 
•sothermal deformation that is zero at infinity. The variation 3e is made up of two parts; 
ii) the change 

£(r - u) - £(r) = - u • grad e 

‘iue to the fact that a particle is brought by the deformation from r - u to r, and (2) the 
jj .^nge - {dE/dp^p div u, due to the change in the density of the substance at the point r; 
ts known (see TP, §1) that div u is the relative change in the volume element, so that the 
^nge in the density is = -p div u. The variation in the free energy is therefore 
8.^ = 8 Q — I 8e(E‘^ l^n)dV 

= — J Po div u d K -I- J (£^/87i) [u • grad e -t- (ce/ dp 


drP div u]dK; 


(15 19) 
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^ Electrostatics of Dielectrics 

the first term is the variation in the free energy when the field is absent. Integrating by parts 
the div u terms in (15.19) and comparing the result with the expression = -|u - f dP 
for the free energy variation in terms of the work done by the forces f, we arrive at (15.12). 


§16. Electric forces in solids 

The dielectric properties of a solid body change not only when its density changes (as 
with liquids) but also under deformations (pure shears) which do not affect the density. Let 
us first consider bodies which are isotropic in the absence of the field. In general, the 
deformed body is no longer isotropic; in consequence, its dielectric properties also become 
anisotropic, and the scalar permittivity e is replaced by the dielectric tensor 
The state of a slightly deformed body is described by the strain tensor 

1 / duj duA 

where u(x, y, z) is the displacement vector for points in the body. Since these quantities are 
small, only the first-order terms in need be retained in the variation of the components 
Accordingly, we represent the dielectric tensor of the deformed body as 

Here Eq is the permittivity of the undeformed body, and the other two terms, which contain 
the scalar constants a„ a^, form the most general tensor of rank two which can be 
constructed linearly from the components 

Let us now see where the derivation given in §15 must be modified. Since, m a solid 
body, F depends on all the components of the strain tensor, we must replace (15.4) by 

SF =’ -D •^E/47i + (df/dMik)^Mik.Forthe virtual displacement considered, thevectoruis 

given by formula (15.5), so that the strain tensor is = (^iM^ -I- Substituting this 

in and using the symmetry of the tensor and therefore of the derivatives dF/du,„ we 
obtain 

dF=-D ■dE!4n + (^inJh)dFldu,,. (16-2) 


It is now evident that we find, instead of (15.7), the following expression for the stress 

a,, = F5,, = (df(16-3) 


Formula (16.3) is valid whatever the relation between D and E. For a body which is 
neither pyroelectric nor piezoelectric, so that = e.tEk, F is given by formula (13. an 
the required derivatives are df/du. - dF,/d«,.-(«.£,£.+ We then P • 

e„ = e„d,. everywhere in (16.3) and obtain the following formula for the stress tensor. 


“f, + (2£o - fli )£(£*/ 871 - (fio + a2)E^SJSn. 


(16.4) 


it volume. In 


t The quantity F in this formula, and in all preceding formulae, is the free energy per 

T me qua y - hat different definition is usual: the thermodynamic quantities are reterm 

,nit volume of the undeformed body, which may after deformation occup 


theory of elasticity, however, a 

■ . amount ofmatter contained in unit vuiuiiic vt. t.,t.u..vaw........-a,.r" V, „i.,r„=manBeit' 

other volume, ft is easy to go from one definition to the other by expressing the relative volume chang 
"formation in terms of the tensor 1.,^; on account of the presence of the derivative with resf^ct to i^j, in ( ■ 

ith allowance for second-order terms. As a result, the first two erms on the right of (16.^) 
r the form dP>du,^, in accordance with the usual formula of elasticity theory. 


thedeformi ^ 
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pg Electric forces in solids 

^( 0 ).^ is the stress tensor in the absence of an electric field, determined by the moduli of 
j-igidity and compression according to the ordinary formulae of the theory of elasticity. 

Let us now make similar calculations for anisotropic solids.t The necessary modifi¬ 
cation of the above argument is as follows. When the layer undergoes a virtual 
Reformation, its crystallographic axes are rotated, and their orientation relative to the 
electric field is therefore changed. On account of the anisotropy of the dielectric properties 
of the crystal, this leads to an additional chaiige in F not shown in (16.2). To calculate this 
changp we can equally well suppose that the crystal axes rotate through some angle ^<t) 
relative to the field E, or that the field rotates through an angle — relative to the axes, 
and the latter approach is the more convenient. 

Thus the variation of the field (15.6) considered above must be augmented by the change 
in E on rotation through an angle - ^(j): 

= -n(E-^)/h-^<t)XE. 

The angle ^<t) is related to the displacement vector u in the deformation by ^<t) = ^curl u; 
this equation is easily obtained by noticing that, when the body rotates through an angle 
^<1), its points are displaced by u = ^<t)Xr. Substituting u from (15.5), we find ^<t) 
= curlzi/2h = nx^/2h, and ^E= -n(E •^)/h + Ex(nx^)/2h = -[n(E •^)-l-^(n-E)]/2h. 
The first term in (16.2) becomes 

_±D -^E = ^ [(n • D)(^ • E) + (^ D)(n • E) ] = -^+ E,Df 
471 Snh 4nn 

Hence we see that the product £^0,^ in (16.3) must be replaced by the second factor in the 
last expression: 

+ {^ + ~(EiD, + E^Df (16.5) 

871 

This expression is symmetrical in the suffixes i and k, as it should be. 

The expression (16.1) for the dielectric tensor, involving two scalar constants, must be 
replaced in the case of a deformed crystal by 

fifk = ( 16 . 6 ) 

'''here is a constant tensor of rank four, symmetrical with respect to the pairs of 
suffixes (, k and /, m (but not with respect to an interchange of these pairs). The number of 
independent non-zero components of this tensor depends on the crystal class. 

We shall not pause to write out here the formula for the stress tensor (analogous to 
(16.4)) which is obtained by using (16.6). 

The formulae which we have obtained give the stresses inside a solid dielectric. They are 
”°t needed, however, if we wish to determine the total force F or the total torque K exerted 
°n the body by the external field. Let us consider a body immersed in a fluid medium and 
^ept at rest there. The total force on it is equal to the integral § Oi^n^df taken over the 
wface. Since the force an^n^ is continuous, it does not matter whether this integral is 
aiculated from the values of given by (16.4) or from formula (15.9), which relates to the 

^ We shall see in §17 that the phenomenon of electrostnction in crystals may, for some types of symmetry 
'er markedly from that in isotropic bodies. Such crystals are said to be piezoelectric. Here, however, we discuss 
‘‘•y electrostriction in non-piezoelectric crystals. 
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medium surrounding the My. If ^ 


eauilibrium (15.18). From this condition, part of the stress tensor (15.9) 
through the body, being a uniform compressing or expanding pressure and making no 
contribution to the total force F and torque K acting on the body. These can there ore e 
calculated by writing as ^ (16.7) 


simplv where E is the field in the fluid and e its permittivity; this expression differs only by 
a factor e from the Maxwell stress tensor of the electric field in a vacuum. Thus 


= (£/47r)f [E(n-E)-^£"n]d/, 

= (e/47r)f [rxE(n-E)-^£"rxn]d/. 


(16.8) 

(16.9) 


It may also be noted that, since the fluid is m equilibrium, we can take these “iteg s 
over any closed surface which surrounds the body in question (but, of course, does not 
enclose any of the charged bodies which are sources of the field). 

The calculation of the total force on a dielectric in an electric field 
be approached in another way by expressing this force, not m terms 

in terL of the field G which would be produced by the given sources m the absence of the 
dielectric- this is the “external field” in which the body is placed. Here it is assumed that the 
distribution of charges producing the field is unchanged when the body is brought im This 

the surface of an extended conductor and the dielectric is brought to a finite distanc ^ 

^'°in a virtual translation of the body over an if^t^simal distance u, the tot^ 
of the body varies, according to (11.3), by - - jP w ( 

_e(r) = (u-grad)G is the change in the field at any given point m the body. S.M 
u = SnstL and curl G = 0, we have P■ (u-grad)G = P-grad(u-G) = u■ (P•grad)t, 

= - u • I (P • grad)G d V. 


But = -u- F, and we therefore have for the required forcef 
F = j(P-grad)GdF. 

Similarly, the rotal torque on .he body can be determined. We shall not go through the 
calculation, but merely give the result; 

K = |PxGdF+ [rx(P-grad)Gd( (Ib.l 

in an almost uniform field, which may be regarded as constant over the dimensions 
the body, formula (16.10) gives to a first approximation 

F = ([PdF-grad)G = ('^-grad)G, • ' 


of' 


— - . u • . lift 101 cannot be interpreted as the force densit)- 

t It should be emphasized, however, that t e in egra ■ ^ internal field 

The reason is that the local forces in the they modify the distribution of forced 

which, by'Newton s third law, contriDu e g 

over the volume of the body. 
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where ^ is the total dipole moment of the polarized dielectric; this result, of course, could 
have been obtained by direct differentiation of ^ from (11.8). In formula (16.11) we 
neglect the second term in the first approximation and reach the natural conclusion that 

K=^xG. (16.13) 


PROBLEMS 

Problem 1. A dielectric sphere with radius a in a uniform external field <£ is cut in half by a plane 
perpendicular to the field. Determine the force of attraction between the hemispheres. 

Solution. We imagine the hemispheres separated by an infinitely narrow slit and determine the force 
from formula (16.8) with e = 1, integrating over the surface of a hemisphere; E is the field in the vacuum near the 
surface. According to (8.2) the field E*'’ inside the sphere is uniform and equal to 3(S/(2 + £), where e is the 
permittivity of the sphere. The field in the slit is perpendicular to the surface and is E = D*" = 3£(J/(2 + £). On 
the outer surface of the sphere we have 

E, = D<« = — C cos e. Eg = £« =- - C sin 6, 

' 2+e 2+e 

where 6 is the angle between the position vector and the direction of C. A calculation of the integral gives an 
attractive forcet 

F = 9(E-lfa^(Syi6(E + 2f. 


Problem 2. Determine the change in shape of a dielectric sphere in a uniform external electric field. 


Solution. As in §5, Problem 4. In determining the change in shape, we assume the volume of the sphere to 
be unchanged, t The elastic part of the free energy is given by the same expression as in §5, Problem 4. The electric 
part is given by (8.9): 




V £<*'-1 
^l + n(£W-l) 




and the permittivity in the x-direction is, by (16.1), s'*’ = Eg + a^u^^ = £o + 3Ui(u„ —u,,) = Eo + ja^(a — b)/R. 
From the condition that the total free energy be a minimum we find (since the quantity concerned is small) 

fl-fc 9eMno-l)" + 5fli/2 
R ~ 40n/j (Eo + 2f 

For £(, -► oc this tends to the value for a conducting sphere. 

Problem 3. Determine the body forces in an isotropic solid dielectric, assumed homogeneous, when 
wtraneous charges are present in it. 

Solution. Assuming e^, a^, Uj constant and using the equations curl E = 0, div D = £o div E = 471^^^, we 
“ave from (16.4) 




§17. Piezoelectrics 

The internal stresses which occur in an isotropic dielectric in an electric field are 
Proportional to the square of the field. The effect is similar in crystals belonging to some of 


t It is by chance that, in the limit e cc, this expression tends to the result obtained in §5, Problem 3 for a 
‘Conducting sphere (indeed, the forces are in opposite directions). The two cases are evidently not physically 
equivalent, because there is no field in the slit between two conducting hemispheres at the same potential, whereas 
^ this problem there is a field in the slit. 

X The change in volume is determined in §12, Problem 1. 
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the crystal classes. For certain types of symmetry, however, the electrostnction properties 
of the crystals are quite different. The internal stresses in these piezoelectric bodies 
resulting from an electric field are proportional to the field itself The converse effect also 
occurs; the deformation of a piezoelectric is accompanied by the appearance in it of a field 
proportional to the deformation. 

Since in a piezoelectric only the principal (linear) effect is of interest, we caii neglect the 
terms quadratic in the field in the general formula (16.5). Then n,* = F^,k + In 

this section we shall use the thermodynamic quantities referred to the matter in unit 
volume of the undeformed body (see the first footnote to §16). Taking F in this sense, we 


have simply 

Accordingly, the thermodynamic relation for the differential dF is 


dF = - S dF + (T.k du.k - D • dE/47t. 


(17.1) 

(172) 


The following remark should be made concerning the last term. In the form given here, this 
term (taken from (10.9)) pertains, strictly speaking, to unit volume of the deformed body. 
By ignoring this fact, we commit an error which, in the case of a piezoelectric, is of a higher 
order of smallness than the remaining terms in (17.2). 

The independent variables in (17.2) include the components of the tensor u,^. It is 
sometimes convenient to use instead the components To do so, we must introduce the 
thermodynamic potential, defined as 


For the differential of this quantity we have 

d^= -Sd7-u,.*d(Tjk-D-dE/47t. 


(17.4) 


It must be emphasized that the use of the thermodynamic potential in electrodynamics m 
accordance with formulae ( 17 . 3 ) and (1 7.4) rests on the validity of ( 17.1 ) and so is possi e 
only for piezoelectric bodies. . 

Having thus defined the necessary thermodynamic quantities, let us now ascertain t 
piezoelectric properties of crystals. If a,, and E, are taken as independent variables, the 
induction D must be regarded as a function of them, and an expansion of this tunction 
must retain the terms linear in them. The linear terms in the expansion of the component 
of a vector in powers of the components of a tensor of rank two can be written, in the most 
general case as 47ry.. „ where the constants form a tensor of rank three, and m 
factor 471 is introduced for convenience. Since the tensor is symmetrical, it is clear tha 
the tensor y- „ may also be supposed to have the symmetry property 

(17.5) 

Tcki = fi.ik- 

For clarity we separate the symmetrical suffixes from the remaining one 

call 7.,^, the piezoelectric tensor. If it is known, the piezoelectric properties of the crystal 

^^Addlng^he piezoelectric terms to the expression ( 13 . 1 ) for the electric induction in the 


crystal, we have 


D. = Doi + e.kEk + 47t7i,k,(Tk,. 


(17.6) 


Corresponding additional terms appear in the thermodynamic quantities. The thermo- 
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dynamic potential of a non-piezoelectric crystal in the absence of a field is ^ = O = 
O —2l^ikim^ik^im’ where <t)o pertains to the undeformed body, and the second term is the 
ordinary elastic energy, determined by the elastic constant tensor For a piezoelectric 

we have 

4) = (17.7) 

The form of the last three terms is given by the fact that the derivatives of O with respect to 
£ (for given temperature and internal stresses), found from the relation D^ — — 47t5^/c'£„ 
must accord^with (17.6). 

Knowing 3>. we can obtain from (17.4) a formula giving the strain tensor in terms of the 
stresses <7;^ and the field E: 

% = - = HM^o,^ + y,,,,E,. (17.8) 

It should be mentioned that to regard the quantities and for a piezoelectric as 
elastic constants and permittivity is to some extent conventional. With the definitions used 
here, they give respectively the strains as functions of the elastic stresses for a given field, 
and the induction as a function of the field for given stresses. If, however, the deformation 
occurs with a given value of the induction, or we consider the induction as a function of the 
field for given strains, the elastic constants and the permittivity will be represented by other 
quantities, which can be expressed as somewhat complex functions of the components of 
the tensors e and y. 

The field in a piezoelectric body must be determined together with its deformation, 
leading to a problem in both electrostatics and elasticity theory. We must seek a 
simultaneous solution of the electrostatic equations 

divD = 0, curlE = 0. (17.9) 

with D given by (17.6), and the equations of elastic equilibrium 

dGtJdx^=Q, (17.10) 

with the appropriate boundary conditions at the surface of the body and use of the relation 
(17.8) between <7,.^ and the strains. In general this problem is very complex. 

The problem is much simplified for a body of ellipsoidal form with a free surface (i.e. one 
subject to no external mechanical forces). In this case (§8), the field inside the body is 
uniform; the deformation is therefore homogeneous, and the elastic stresses <7,^ = 0. 

inally. let us consider which types of crystal symmetry allow the existence of 
piezoelectricity; in other words, what are the restrictions imposed on the components of 
e tensor by the symmetry conditions. In general, this tensor (which is symmetrical in 

The tensor determines the relation between stress and strain: 

-r>C) = 

ure exa«i'’' relation (Tjj = is used. It is evident that the symmetry properties of the tensor 

^raa«iy the same as those of 

^ tree energy F contains the elastic energy with the plus sign: 

F^i = 

thermodynamic potential is obtained from F by subtracting and so 
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the suffixes k and /) has 18 independent non-zero components, but in reality the number of 
independent components is usually much smaller. 

In all symmetry transformations of a given crystal, the components of the tensor >>; 
must remain unaltered in value. Hence it follows at once that no piezoekctnc body can 
have a centre of symmetry or, in particular, be isotropic. For, on reflection in the centre (i.e. 
change of sign of all three coordinates), the components of a tensor of rank three change 

* ^Of the 32 crystal classes, only 20 allow piezoelectricity. These comprise ffie ten 
enumerated in §13 as allowing pyroelectricity (all pyroelectrics are also piezoelectrics) and 
the ten following classes; 


orthorhombic system: D2 
tetragonal system; D^, D2d, S4 
rhombohedral system; D3 
hexagonal system: D^, Cj^,, D^h 
cubic system: T. T^. 


n the 


The non-zero components of the piezoelectric tensor for each class are given i 

Problems below. ... . • . u 

Mention may also be made here of a phenomenon akin to piezoelectricity, which results 
from the “deformation” of a liquid crystal. We shall consider what are called nematic 
crystals (SPl, §140), liquids in which there is a distinctive direction ot preferred 
orientation of the molecules. At each point in the medium, this direction is specified by a 
unit vector d, the director of the crystal. In an undeformed liquid crystal, d has the same 
direction everywhere, but in a deformed one this direction is a function of the coordinates. 
The expansion (17.6) corresponds to an expression for the induction in a liquid crystal in 

the form £>. = e.^£^ + 47tejd( div d-l-47te2(curl d x d);, (17.11) 


where e, and e, are scalar coefBcienls (R. B. Meyer, 1969).t The las. two .™s. w|« 
describe the effect in question, constitute the most general polar vector that can be for 
from d and its first derivatives with respect to the coordinates. The expression (17.11) ^ 
automatically invariant under a change in the sign of d. 

The permittivity tensor of a nematic crystal has the same symmetry as for uni^m 
crystals, the axis of symmetry being represented by the local (at each point) direc 1 n 
director. The tensor may be expressed as 


= EoSii^+e^d/di^, 


(17.121 


with two independent constants Bq and e„. 


PROBLEMS 


‘ PROBLEM 1. Determinethenon-zerocon,ponentsofthetensorvr.„fornon-pyroelec,riccrys,alclasseswh.ch 

allow piezoelectricity 


Solution. The class Oj has three mutually perpendicular twofold axes jinates- 

.es X y and z. Rotations through 180^ about these axes change the sign of two out of the three coord.n 
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S nce the components '/(.m are transformed as the products x,XjXj, the only non-zero components are those with 
three different suffixes: (The other non-zero components are equal to these, since y, „ = y, ^.) 

/Accordingly, the piezoelectric part of the thermodynamic potential ist 

5>pie = —(1) 
The class f)2<i obtained by adding to the axes of class D2 two planes of symmetry passing through one axis 
tthe r-axis, say) and bisecting the angles between the other two. Reflection in one of these planes gives the 
transformation x -► y, y -► x, z -► z. Hence the components y; m which differ by interchange of x and y must be 
al, so that only two out of the three coefficients in (1) are now independent; — y^ x^- 

The class T is obtained from the class Dj by adding four diagonal threefold axes of symmetry, rotations 
about which effect a cyclic permutation of x, y, z, e.g. x -► z, y -»x, z -► y. Hence all three coefficients in (1) are 
equal: y, z = ty.zx = tz.xy The same result is obtained for the cubic class T^. 

"The class £>4 has one fourfold axis of symmetry (the z-axis, say) and four twofold axes lying in the xy-plane. 
Here the symmetry elements of the class £>2 ^e supplemented by a rotation through 90° about the z-axis, i.e. the 
transformation x -► y, y -► — x, z -► z. Consequently, one of the coefficients in (1) must be zero (y^ = — y^ y^ = 

= 0), and the other two are equal, but opposite in sign: y, = - y,,,^. The same result is obtained for the 
class Df,. 

The class ^4 includes the transformations x-»y, y-> — x, z-> - z and x -► - x, y -► — y, z -»z. The non-zero 
components are y^ y,,,jz = yy,xz> Iz.xx = ~yz.yy< yx.zx = ~yy.zy Of® of these can be made to vanish by a 
suitable choice of the x and y axes. 

The class £>3 has one threefold axis of symmetry (the z-axis. say), and three twofold axes lying in the xy-plane; 
let one of these be the x-axis. To find the restrictions imposed by the presence of a threefold axis, we make a formal 
transformation by introducing the complex “coordinates” i = x + iy, t/ = x-iy; the coordinate z remains 
unchanged. We must also transform the tensor y^ to these new coordinates, in which the suffixes take the values 
I/, z. In a rotation through 120° about the z-axis these coordinates undergo the transformation i ->■ 

1/ -» z-xz. The only components of the tensor y,- j,- which remain unchanged and so may be different 
from zero are y^ y, y^ y^ jj, y, ,, and y^ A rotation through 180' about the x-axis gives the 
transformation x ^ x, y’-> - y, z z, or ^ 1/, t/ ^ ^, z - z; y^ and y^ „ change sign and so must be zero, 

while the remaining components listed above are mutually transformed in pairs, giving y, = -y^.j,, y^.^j 
= y,.„. In order to write an expression of <i>p,5, we must form the sum —yi^uE^Oy^, in which the suffixes take the 
values 4,1/, z: 

3>pie = -2y,,,,(£„<T,,-£,,7„)-y,.„(£,,7„-l-£,<T„). 

Here the components £; and a^y in the coordinates 4,1/, z must also be expressed in terms of those in the original 
coordinates x, y, z. This is easily done by using the fact that the components of a tensor are transformed as the 
products of the corresponding coordinates. Hence, for example, from = xx - yy -I- 2ixy, we have 
~(’yy + lw,.y. The result is 

3>p;e = 2q(£,<T„ - E,a,y) b [lEyC^y - EjO,y - Cyy) ], (2) 

where a = 2iy„ and h = 2y^ are real constants. The relations between the components y; ^ in the coordinates 
k, z are, as we see from (2),t 

th^^f ** f''o™ fbe class £>3 by adding a plane of symmetry (the xy-plane) perpendicular to the 

°in this plane changes the sign of z, and so y„ ,, = 0, so that only the term with the 
®odficient b remains in (2). 

the ■ ^ threefold axis and a plane of symmetry perpendicular to it. Reflection in this plane changes 

acc components y,- „ whose suffixes contain z an odd number of times must be zero. Taking into 

onlv'th* restrictions derived above which are imposed by the threefold axis of symmetry, we find that 

y e two components y„ ,, and y^ are not zero. These quantities must be complex conjugates in order that <i> 

To avoid misunderstanding it should be recalled that, if we calculate the components of the strain tensor u^y 
differentiation of the actual expression for Q> with respect to (7,y, the derivatives with respect to 
are k ^ ^ 8*''® ®®''''®sp°"ding components Ujt, because the expressions = -cO/diT.r 

meaningful only as representing the fact that d® = -Uiydc^y, and the terms containing the 
^ cntials of non-diagonal components of the symmetrical tensor Ciy appear twice in the sum u,nd(T,j. 
disti " "“"■“'■‘I’ogonal coordinates such as t], z the covariant and contravariant components of tensors must be 
^ nguished. This should have been done in returning to the original coordinates x, y, z: if the components £; and 
contravariantly, then those of the tensor y^ transform covariantly. We avoid this necessity, 
thp obtaining the required relations between the components y. m in the coordinates x, y, z directly from 

term of the scalar combination (2). 



should be real. Putting 2y„ 


Either a or b can be made t< 
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^ a-ib, = a+ ib, we find 
a [2V,, - - <T,,) ] + b + £,(<rx, 

vanish by a suitable choice of the x and y axes 


IS Problem 1, but for the crystal classes which allow pyroelectricity. 


Solution. Let the z-axis be the twofold, threefold, fourfold or sixfold axis of symmetry, or in the class C. be 
perpendicular to the plane of symmetry. In the classes C„„ the xz-plane is a plane of symmetry. 

We give below for each class all the components y,which are not zero. 

Class C,; allVi t,. 

C,: all those in which the suffix z appears twice or not at all. 


Cj: the same, together with y^ ,,^ = -y^.xz- y^.xx - ~yy.yy “ >’*.*»• 

Q.: Vz.zz- Vx.xz = >’v.vz. Vz.xx = Tz.yy 

C(,: the same, together with = -Vj.xz- 

Byasuitablechoiceofdirectionsofthex.y,zaxesthreemorecomponentsc^bemadezerointheclassC,,and 

bv a choice of the X and v axes one more component can be made zero in the classes C„ C^, C3, in Q and C^. the 
expression yi.ij£i<r« is invariant under rotation through any angle about the z axis, and therefore no ur er 
reduction in the number of non-zero components yi ,^ is possible. 

problem 3. Determine Youngs modulus (the coefficient of 

capacitor, (c) where it is stretched parallel to its plane with no external field. 

SOLUTION, 

the extending stress <7„ (the z-axis being perpendicular to the slab).+ From (17.8) we have „ /i^zzz 

= 0, Uxz = /^zzzz<^zz + Vz zz^z- Eliminating E^, we obtain I,£ - /ixzzz „ ^ave D = e E, 

(c) in this case also, £, = £, = 0, D. = 0, but the extension is along he x-axis, say. Here we 
,^4' ^ = 0, = yi^xxx'^xx + Vz.z.Ez- Eliminating £^, we obtain 1,£ - yixxxx "*^Vz,xx / zz- 


Obtain an equation for the velocity of sound in a piezoelectric 


’ROBLtM 4. mi -- ^ 

OLUTiON. In this problem it is more convenient to use u, as the independent variables, instead of ... We 
te f in the form 


The equations of motion from the theory of elasticity are 

where p is the density of the medium, and u is the displacement vector, related to u. by 




ed to coincide with any particular crystallographic direction. 
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and the field can be expressed in terms of the field potential: £,■ = - which takes into account the equation 

curlE = 0. 

In a plane sound wave, u and (j> are proportional to exp [i(k • r — tot) ], and we find from the above equations that 
+ = 0. 

Eliminating (j>, we can write the condition of compatibility of the resulting equations for Uj as 

det|pto^^i^-A,.^,„k,k„-47t(/?, = 0 . 

For any given direction of the wave vector k, this equation determines three phase velocities of sound w/k, which 
are in general different. A characteristic property of a piezoelectric medium is the involved relation between the 
velocity and direction of the wave. 

Problem 5. A piezoelectric crystal of the class C(,„ has a plane boundary (the xr-plane) which passes through 
the axis of symmetry (the z-axis). Find the speed of surface waves propagated at right angles to the symmetry axis 
(in the x-direction), in which there are oscillations of the displacement and the electric field potential (j> (J. L. 
Bleustein, 1968; Yu. V. Gulyaev, 1969). 

Solution . Under the conditions considered, two equations involving only and (j> separate out from (4) and 
( 5 ); these quantities depend on the coordinates x and y, and on the time t, but not on z. The non-zero components 
of the stress tensor and the induction vector are 


(T„ = PE^ + = PE^ + 2Xu^^, 

- SnPu^^ + e£„ D, = - Sr^Pu^y + 

= jdujdx, Ujj = jdujdy, 

£, = - d(l>/dx. £, = - d<^/dy, 

and writing for brevity P^ „ = Pyyz = P< Kzxz = ^yzyz = = ^yy = ‘^e constant pyroelectric induction 

Dj = Dg does not appear in the equations or the boundary conditions. 

Equation (5) and the z-component of equation (4) give, in the region occupied by the piezoelectric medium (the 
half-space v > 0), 

47t/?Au,-l-£A(^<'' = 0, pu^= -p/S<l>^» + XAu^, 




where 


= d^/dx^ d^ldy^-, these may be rewritten as 

pii^ = AAn^, At(' = 0, 


(6) 


X = X + 4tiP^/e, \p = (4 tiP/e)u^ 

In the vacuum (the half-space y < 0), the potential satisfies the equation 
A(^<‘’' = 0. 

These equations are to be solved with the following boundary conditions: at the surface of the medium. 

— 0, = —d(j>'‘ydy for y = 0. 

®nd far from the surface 


(7) 

(8) 


as y->cx); -► 0 as y-► + oo. 

seek the solution in the form 


With 




pw^ = X(k^-K^). (9) 

Equations (6) and (7) and the conditions at infinity are then satisfied, and the conditions (8) give three linear 
Uotnogeneous equations for A, B and C; the condition for these to have a solution is 


R = 47tP^fe/Aa(l+£) = Afe. 
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Finally, substitution in (9) gives the phase velocity of the waves: 

toA= [(X/P)(l-A")]i 

The surface propagation of these waves is restricted to piezoelectric media. As P -► 0. the penetration depth 
oc, and a bulk wave is formed. 


§18. Thermodynamic inequalities 

According to the formulae of §10, the total free energy can be written as the integral 
^ = \F{T:p,D)dV, (18.1) 


taken over all space. We shall suppose that the function D(r) which appears in the 
integrand satisfies only the equation 

divD=0 (18.2) 

inside a dielectric and the condition 

|D-df=47te (18.3) 


on the surface of a conductor which carries a given charge. These equations establish the 
relation between the field and its sources. Otherwise we regard the function D(r) 
as arbitrary, and in particular we do not require it to satisfy the second field equation 
curl E = 0 (where E = 4ndF/dD) or the boundary condition = constant on the surface 
of a conductor. We shall show that these equations can then be obtained from the 
condition that the integral (18.1) be a minimum with respect to changes in the function D(r) 
which satisfy equations (18.2) and (18.3). It should be emphasized that the possibility ot this 
derivation is not a priori evident, since the field distributions which come into 
consideration in determining the minimum of the integral (18.1) do not necessany 
correspond to physically possible states (because they do not satisfy all the field equations), 
whereas, in the thermodynamic condition that the free energy be a minimum, only the 
various physically possible states are considered. , • j- „ 

The problem of finding the minimum of the integral (18.1) with the subsidiary 
conditions (18.2) and (18.3) is solved by Lagrange’s method of multipliers. We multiply me 
variation of the condition (18.2) by some as yet undetermined function 
coordinates, and that of the condition (18.3) by some undetermined constant (PJ4n, am 
then equate to zero the sum of variations 


^5FdV-j^^(pdix5DdV + ^ |^D-df = 0. 


In the first term we write! 


and the second can 
• The result is 


5F = (0f/dD)r,p -^D = E -^0/471, 

be integrated by parts: ji^div^D^F = f #D -df- f • grad dF 
J (E + grad (^) • 5D d F + |(<^o -•‘If = 0- 


t The free energy is the minimum for a given temperature. The variation is with resp^t to two independent 
quantitfes D and p. Here we are interested only in the result of varying with res^^^^^ 


inreg;ani8'"ir:;hhr;s;;mos;;^Vy'rwi.h^he-^^^^^^^ 

of the uLal conditions of thermal equilibrium, namely the constancy of the chemical potential C. 
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Hence we conclude that, throughout the volume, we must have E = - grad (and so 
E = 0), and on the surface of a conductor <^ = <^o = constant. These are the correct 
equations for the field, and the Lagrange multiplier 4> is its potential. 

Similarly it can be shown that the equations for the electric induction are obtained from 
the condition that the integral # = j F(T, p, E)d F be a maximum, in which the function 
E(r) is varied with the subsidiary conditions E = —graded and 4> = constant on the 
surface of a conductor.t For 

= j(dF/5E)-^EdF= jD-grad^<^dF/ 47 t 
= f ^<^D-df/47t-j^<^divDdF/47t = 0. 

The first integral is zero because = 0 on the surface, and from the second we find the 
required equation div D = 0, since is arbitrary in the volume. 

If the body is not in an external electric field (in particular, if there are no charged 
conductors), it may be possible to formulate the condition of themodynamic equilibrium 
as the condition that the total free energy (18.1) have an absolute (unconditional) 
minimum. This amounts to the condition that the free energy density f be a minimum as a 
function of the independent variable D: dF/dTi = E/ 47 t = 0, i.e. the field must be zero in all 
space. If it is possible to find a distribution of the induction such that div D = 0, this state 
will correspond to thermodynamic equilibrium.} 

Equating to zero the first variation of the free energy, we find necessary but not sufficient 
conditions for this energy to be a minimum. The determination of the sufficient conditions 
requires a discussion of the second variation. These conditions take the form of certain 
inequalities (called thermodynamic inequalities) and are the conditions which ensure the 
stability of the state of the body (see SP 1, §21). 

When D = cE. the situation is much simplified, and the thermodynamic inequality of 
interest here (relating to the dielectric properties of the body) becomes evident. The total 
free energy + i (£>^/ 87 t£)d F. It is clear that this can have a minimum only if £ > 0, 
since otherwise the integral could be made to take any large negative value by making 
large enough. Thus in this case nothing new is learnt, since we know already that the 
permittivity must in fact be not only positive but greater than unity (see §14). 

In the general case of an arbitrary relation between D and E, however, it is necessary to 
consider the second variation of the integral (18.1), and to vary simultaneously both D and 
P (leaving only the temperature constant). In an isotropic body, F (7, p, D) depends only on 
the magnitude of the vector D, but its three components vary independently. We take the 
direction of the vector D before variation as the z-axis. Then the change in the magnitude 
of D is given in terms of the changes in its components, as far as the second-order terms, by 
= SD^ + (SDJ^I2D + (SD^)^I2D. The first and second variations of the integral (18.1) 


^ The point that the thermodynamic potential # has a maximum, not a minimum like with respect to the 
_f^ble E or D is a general one, and is accounted for as follows. Let the equilibrium value of a variable x, say x 
~ 0, be determined by the condition of thermodynamic equilibrium. Then the free energy 3^ has, for a given T and 
’ ^ titinimum at x = 0. We thus have, at x = 0, AT = (d^ldx)y j = 0, and near that point X = ax, 3^ — 

+ jax-* with a > 0. With the thermodynamic potential ^ ~xX, this gives # = = .^o - ^^/2a, 

° that in equilibrium # has a maximum with respect to x or X. But both and # have minima with respea to 
“ny other variables y that are independent of x. 

. I Here we are considering bodies in which D need not be zero even if E = 0 (see §19). Otherwise we have 
«niply the trivial result E = D = 0 in all space. 
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are both contained in the expression 






dD dp 


dDdp ^ 2 dp^ \ 


Substituting SD and collecting the second-order terms, we find the second variation 




dDdp 


(18.4) 


These two terms are independent. The first is positive if (llD)dF/dD > 0. But 
dF/dD = E/47r, so that the derivative dF/dD is positive or negative accordinps the vectors 
D and E are in the same or opposite directions. Thus these vectors must be in the same 

direction. , . . 

The conditions for the second term in (18.4) to be positive are 


d^Fjdp^ > 0, 


d^F d^F / d^J^Y ^ ^ 
dp^ dD^ ydpdDj 


(18.5) 

(18.6) 


(18.7) 


^> 0 . 


Since dF/dp = C, dF/dD = E/4n, the first of these gives 

(^C/^pW>o, 

and the second can be rewritten as a Jacobian; 

d{dF/dD,dF/dp) _ 1 d(E,0 . 
d(D, p) 47t d(D, p) ' 

Changing from the variables D, p to D, C, we have 

d(E, 0 d{E, 0 d{D, 0 ^ / d^\ 

~d{D. p) ' d(D, 0 d{D, p) \dDj\df 

by (18.7), this gives 

(dE/dDf j > 0. 

Thus we have derived the required thermodynamic inequalities. In the absence of a fiel^^ 
the inequality (18.7) becomes the usual condition that the ‘^°‘hermaUompress^b^ y 
positive; {dPldp\ > O.t The inequality (18.8) gives e > 0, since when £ -> 0 the induct 

^ Of the two inequalities (18.5). (18.6) the latter is the stronger; it may be violated while the 


t >0; 


( 18 . 8 ) 


t I, should be replied that, in the absence of a field, C is the thermodynamic potential of uni, mass and, by th^ 
ordinary thermodynamic relations, its differential 

dC = dF/p-(S/p)d7, 

(that the specific heat is positive) is ignored. 
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first is not, whereas the reverse is impossible. The equation 

d^F d^F f d^F V_V d{E,Q 
dp^ dD^ ydpdDJ 4n d(D,p) 

corresponds to what is called the critical state (see SP 1, §83). This condition is more 
conveniently written in a different form by multiplying it by the non-zero factor 
£(D, p)^8{E,p): 

8{E, 0/8(E, p) = (BC/Bph^r = 0. (18.9) 

The critical states occupy a curve in the £T-plane, which is a singularity of the 
thermodynamic functions of the body, just as the critical point is a singularity in the 
absence of the field. 


§19. Ferroelectrics 

The various crystalline modifications of a given substance may include some which are 
pyroelectric and some which are not. If the change from one to the other takes place by 
means of a second-order phase transition, then near the transition point the substance has 
a number of unusual properties which distinguish it from ordinary pyroelectrics; these are 
called ferroelectric substances. 

In an ordinary pyroelectric crystal, a change in the direction of the spontaneous 
polarization involves a considerable reconstruction of the crystal lattice. Even if the final 
result of this reconstruction is energetically favourable, its realization may still be 
impossible because it would require the surmounting of very high energy barriers. 

In a ferroelectric body, however, the situation is quite different because, near a second- 
order phase transition point, the arrangement of the atoms in the crystal lattice of the 
pyroelectric phase is only slightly different from the arrangement in the non-pyroelectric 
lattice (and so the spontaneous polarization also is small). For this reason the change in 
direction of the spontaneous polarization here requires only a slight reconstruction of the 
lattice and can occur quite easily. 

The actual nature of the ferroelectric properties of a body depends on its crystal 
symmetry. The direction of the spontaneous polarization of the pyroelectric phase (which 
shall call the ferroelectric axis) is determined by the structure of the non-pyroelectric 
P ase beyond the transition point. In some cases it is uniquely determined, in the sense that 
e ferroelectric axis can lie in only one, crystallographically determinate, direction; the 
’tection of the spontaneous polarization is then determined apart from sign, since in the 
^On-pyroelectric phase the two opposite directions parallel to the ferroelectric axis must be 
^quivalent (otherwise this form of the crystal would also be pyroelectric). In other cases, the 
. ymmetry of the non-pyroelectric phase may be such as to allow spontaneous polarization 
^oy of several crystallographically equivalent directions.t 

‘yp® sodium potassium tartrate, whose non-pyroelectric phase has orthorhombic 
•Wof ferroelectric axis appears in it In a completely definite crystallographic direction (one of the 

lold axes), and the lattice becomes monoclinic. 

of m second type is barium titanate. Its non-pyroelectric modification has a cubic lattice and any 

tra ^ become the ferroelectric axis. After the spontaneous polarization has appeared at the 

directions, of course, are no longer equivalent. The ferroelectric axis becomes the only 
oUold axis, and the lattice becomes tetragonal. 
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The occurrence of polarization is always associated with a reduction in the symmetry of 
the crystal. We can therefore refer to pyroelectric and non-pyroelectnc as e 

unsvmmetrical and symmetrical phases respectively, using the terminology of SP , § . 

We shall show how the theory of ferroelectricity can be developed in terms of 
the general theory of Landau second-order phase transitions, as was first done by 

We?ake^th?dklectrk; polarization vector P of the substance as the order parameter 
whose magnitude determines the difference between the unsymmetncal and symmetrical 
Sase to structures. This means that P will be regarded as an independent 
thermodynamic variable whose actual value (as a function of the temperature, the eld, 
etc.) is then determined from the condition of thermal equilibrium, namely that the 

thermodynamic potential be a minimum. 

Let us consider first the case where the position of the ferroelectric axis, which we take as 
the z-axis is uniquely determined. The dielectric properties of the crystal in the x and y 

direetions\hen exhibit no anomafies, and to investigate the properties m the 

need consider only those terms in the thermodynamic potential which contain Nea the 

transitionpoint,theorderparameterP.is small and the thermodynamicpotential (Dean be 

expanded in powers of P.. Since the two directions of the z-axis are equivalent, the 

expansioncannot depend onthesignofP^and therefore containsonly even powers. Asfar 

as the fourth-order terms, we have 


In the symmetrical phase, A > 0, and P, = 0 corresponds to a minimum of the 


In the symmetrical phase, A > U, and = u eorrespuuu. it is 

thermodynamic potential. For spontaneous polarization to occur, A 
therefore zero at the phase transition point. The Landau theory assumes that 
expanded in integral powers of T- P„ where f is the phase transition temperature, n^ 
thfs point, we write A = a{T- TJ, a being a constant (independent of the 
shall take the specific ease where a > 0, so that the unsymrnemeal 
temperatures T<f. The condition for the state to be stable at the pomt P c 

the coefficient B be positive at that point and therefore throughout a neighbourhood ot • 

In what follows, B will denote B{f). _ ti,^r,r.nHvnpmic 

If the electric field in the body is not zero, further terms appear in the thermodynain 

potential. To find these, we start from the relation 


4n8^/dE = - D = - E - 47rP. 

Integration with a fixed value of the independent variable P, using the fact that <1> and <5 are 

--- 17 = n oivps 


the same when E = 0, gives 

0(P, E) = C)(P, 0) - E • P - E^/Stt. 

With the electric field in the z-direction, and a)(P,0) from (19.1), we have 
• 3, = + a(T _ 7,)P,^ -f BP/ - EzBz - 


(19.3' 




rhr;;;;rn;bork:Actually, many ferroelectric transitions are not secon 

second-order. This seems to be due to the fluctuation effect mentioned in SP 1, end of §146. 
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The presence of the term — has the result that in any field E^, however weak, the 
order parameter becomes different from zero at every temperature; the field polarizes 
the non-pyroelectric phase and thus reduces its symmetry. The qualitative difference 
between the two phases thereby disappears, and accordingly so does the discrete transition 
point; the transition is “smoothed out”.t 

The thermodynamic potential 2> in equilibrium must be a minimum, for any given field 
£ Differentiation of (19.3) at constant E^ gives 

2P^a{T - T,) + = E^. (19.4) 

This is the basic relation between the field and the polarization in a ferroelectric.| 
When r> Tf, in the non-pyroelectric phase, P^ vanishes with E^. As E^ increases, the 
polarization at first increases linearly, P^ = kE^, with susceptibility 

K = l/2a(T-T^), r> (19.5) 

which increases without limit as T -*•7’^. The induction = (1 - 1 - 47 ck:)£^ also increases 
linearly with P^. Near the transition point, k is large, and we have to the same accuracy 

e S 47 CK = 2nla{T-T^). (19.6) 

In sufficiently strong fields, the polarization increases according to P^ = {E^I4B)k 
When T <T^, in the pyroelectric phase, P^ = 0 cannot correspond to a stable state. For 
= 0, we find from (19.4) the spontaneous polarization of this phase, 

P,o= ±yJla(T,-T)/2B]. (19.7) 

The dielectric susceptibility of the phase can be found as the derivative as E^ -*■ 0. 

From (19.4), 

[ - 2(P, - T)a -t- 12BP/ ]dPJdE, = 1, (19.8) 

and substitution of (19.7) gives 

K = [dPJdEJ^^^^= ll4a(T^-T), T <T^. (19.9) 

This is half the susceptibility of the non-pyroelectric phase for the same value of IP^ —P]. 
In sufficiently weak fields, the polarization is P^ = P,,o + kE^, the induction is 
+ £^ 2 , where = and the permittivity is 


E = 4TtK = nla{T^ — T). (19.10) 

Figure 14 (p. 80) shows the function P^{E^) given by (19.4) for P < P^. First of all, it 
^ ould be noted that the dashed part cc’ does not correspond to stable states; from (19.8) 
"'•'itten in the form 

(dPJdE,)(d^<^/8P/)^ = 1 


see that dP^/dE^ < 0 implies that d^O/dP^^ < 0, i.e., the thermodynamic potential O 
^ s a maximum, not a minimum. The ordinates of c and c' are given by the equation 
2 /dP^ = 0, and we conclude that the possible values of |PJ in the pyroelectric phase are 

t T §144. The discussion below is largely a repetition of the one given there, 

onlv *’1' I*® "** substituting in (19.3), we find the potential C)(E) as a function of E 

(Whi pcondition P. E)/dP = 0, the equation D = - 4nde>/dE is valid both for 6(E) and for 6(P, E) 
''ch IS differentiated at constant P). 
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restricted by the condition 

Pf>if~T)al6B. 

If we consider states of a ferroelectric with given values of there is still an ambiguity 
in the value of P^, in the range of abscissae between c and c', and the question arises ot the 
physical significance of the two values. We shall assume the ferroelectric to be a 
Lmogeneous flat slab, with the ferroelectric axis normal to it, lying between the plates of a 
capacitor, which are maintained at given potentials, i.e. which set up a given uniform 

For given potentials on the conductors, the condition of stability requires that the 
thermodynamic potential C. be a minimum. In particular, for E = 0 there are two states m 
which P has opposite signs (the points a and d in Fig. 14) but O (- C) is the same. These 
two states, therefore, are equally stable, i.e. they are two phases which can coexist m 

''°HSe it is clear that the portions ac and a'c' of the curve correspond to states which are 
metastable but not absolutely stable. It is easy to see directly that the values of O on and 
on d c' are in fact greater than its values on dh' and ab for the same value of The 
ordinates of a and a' are given by formula (19.7). Thus the range of metastability is 


(f-l )a/6B < Pf < {T-najlB. (19.12) 

The existence of these two phases with E = 0 is very important, since it means that a 
ferroelectric body can be divided into a number of separate regions or domains in which the 
polarization is in opposite directions. On the surfaces separating these doniains, the 
normal component of D and the tangential component of E must be continuous. The latte 
condition is satisfied identically, because E = 0. From the former condition it follows that 
the domain boundaries must be parallel to the z-axis. u * t i 

The actual shapes and sizes of the domains are determined by the condition that the tota 
thermodynamic potential of the body should be a minimum.t 
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§19 

If we are not interested in the details of the structure, and consider portions of the body 
which are large compared with the domains, we can use the polarization P averaged over 
such portions. Its component can evidently take values in the range between the 
ordinates of a and a’ in Fig. 14, i.e. 

- V [(7c- T)al2B]<P^<J [(T;-7')a/2B]. (19.13) 

In other words, if P^ in Fig. 14 is taken as the polarization averaged in this way, the vertical 
segment aa! corresponds to the region of domain structure, and the thick curve baa'b' gives 
all stable states of the body. 

Let us consider ferroelectrics which belong (in the non-pyroelectric phase) to the cubic 
system.t The cubic symmetry admits two independent fourth-order invariants formed 
from the components of the vector P, which may be taken as {P^ -I- P^^ -f P^^f and 
P^Py^ + + Py^Pz^- Hence the expansion of the thermodynamic potential near the 

transition point (when E = 0) is of the form 

O = Oo + a(T - r,)( P,^ + P/ -I- P/) + B(P^^ + P/ -I- P^y 

+ C( P^^P/ -I- PjP/ 4- Py^P/), (19.14) 

where a, B, C are constants, and the x, y, z axes are along the three fourfold axes of 
symmetry. 

The sum of the fourth-order terms in (19.14) must be essentially positive. Hence we must 
have 

B>0, 3B-|-C>0. (19.15) 

The spontaneous polarization of a ferroelectric when E = 0 is determined by the 
condition that <1> should be a minimum as a function of P. In particular, since the second- 
order term and the first of the fourth-order terms are independent of the direction of P, the 
direction of the spontaneous polarization is determined by the condition that the next term 
be a minimum for a given absolute value P. Two cases are possible. If C > 0, the minimum 
of this term corresponds to P being along any one of the axes x, y, z, i.e. along any of the 
three edges of the cube. If, however. C < 0, the minimum value occurs when P is along any 
one of the spatial diagonals of the cube, i.e. when P^^ = Pj^ = P^^ = jP^. In the former 
oase the pyroelectric phase of the ferroelectric has tetragonal symmetry, and in the latter 
it has rhombohedral symmetry. 

th consider in more detail, for example, the first case (C > 0), and take as the z-axis 
^ 0 irection of the spontaneous polarization below the transition point. The magnitude 
0 of this polarization is determined (when E = 0) by the minimum of the expression 
'a{T^~T)P^ + BP\ whence 

Po^ = a{T^-T)l2B. (19.16) 

To find the polarization as a funcfion of the field E, we must add to (19.14) a term - P • E 
us changing to the potential 3>) and equate to zero the derivative dO/dP. 

'^hen the field E is weak, P„ P^,, and P, - P^ are small. Omitting from the equations the 

P p ‘‘"U Ct, are envisaged here. The cubic classes Zand T, allow also a third-order invariant 

■ninimurmr “Editions, the state with P = 0 certainly could not satisfy the stability condition 
of Ti n ^ second-order phase transition is therefore impossible. The symmetry of the class O (and that 

«ructu^(cr§52T™"‘‘'’‘ incommensurate 
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terms of the second and higher orders of smallness, and substituting Poz = ^ from (19.16), 
we find the longitudinal polarization 


P,-Po = EJ4a{f-T) 


(19.17) 


and the transverse polarization 


= BEJaC(f-T) 


(19.18) 


(and similarly for P^). Above the transition point, in the non-pyroelectnc phase, the 
dielectric susceptibility of a cubic ferroelectric is the same in all directions: 


P = EI2a{T-f). 


(19.19) 


Finally, let us briefly consider the elastic properties of ferroelectrics. According to its 
crystal class, the non-pyroelectric phase of a ferroelectric may or may n^ be 
Let us begin with the former case, and suppose that the symmetry admits a P ezoelectnc 
(linear) relation between the deformation and the polarization in the d»-ection of he 
ferroelectric (z) axis. These include the classes D„ D,, and 5^; m each case the PO>^"^^tmn 
P, appears in the piezoelectric part of the thermodynamic potential through 

I^the da'stic energy of these crystals, the component u,, appears in a term - 
Thus the thermodynamic potential near the transition point is 


3) = Oo + a{T-f)P,^ + BPf -yP,a,y--E,P,- EfISn, (19.20) 


where for brevity we have put y..,, = y, P.y.y = P-t The terms ‘^e ote 

components of P and are of no interest, since they lead to no anomaly of the 
piezoelectric properties near the transition point. 

Equating to zero the derivative dO/dP, with constant, we obtain 


E^ = 2a{T-f)P, + 4BP,^-ya,y. ( 19 - 21 ) 

The components of the strain tensor are found by differentiating the therniodynam.c 
potential (19.20) with respect to the corresponding components 0 ;^ (see 1/.4)). 

u.y^bP. + P<r.r 

In the non-pyroelectric phase when E is small we can neglect the term in P^ m (19.21) 
E^ = 2a{T-f)P,-yc,y. 

Substituting P^ in (19.22), we find 


- 4a(7'-P,) L 

Thecoeffiden,of».,,in,hlsfo™ula,q,rese„.s.hemoduluso^ 

in which the field P, is kept constant, while p in formula (19.22) is the modu 




t Thenon-pyroekctric phase of a ferroelectric is piezoelectric if it belongs to one of eight out of the ten classes 
^TseeXls^ootn^'to §17! Problem T. con'S di^rentiation tvith respect to the components of a 
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polarization P^. Hence we can write 

^iE) = ^fP) + y2i4a{T-TJ, (19.23) 

where the superscripts indicate the nature of the deformation. We see that the two 
coefficients behave entirely differently; whereas is a finite constant, increases 
without limit as the transition point is approached.! 

In the pyroelectric phase, formula (19.22) shows that the spontaneous polarization 
results in a certain deformation of the body. If there are no internal stresses and the field E 
is zero, the strain is proportional to i.e. varies with temperature as yj(T^ — T). 

If the symmetry (cubic, for example) of the non-pyroelectric phase of a ferroelectric does 
not admit a linear piezoelectric effect, then the first non-vanishing terms in an expansion of 
the thermodynamic potential in powers of and P are quadratic in the components of P, 
i e. they are of the form 

-yikimPiPk^^im, (19.24) 

where is a tensor of rank four, symmetrical with respect to the pairs of suffixes /, k and 
I, m. In such cases, the strain in the pyroelectric phase due to the spontaneous polarization 
is quadratic in Pg, and accordingly varies with temperature asT^ — T. 

Doubt might be cast on the legitimacy of using the expression (19.24) in the 
thermodynamic potential, on the grounds that, as stated in §17, this potential can be used 
only when quadratic effects are neglected. However, the ferroelectrics form an exception 
because, near the transition point, the field E is small compared with the polarization P or 
induction D, because of the unlimited increase in the dielectric susceptibility. The use of the 
thermodynamic potential involves the neglect of the quantities of the order of EDun, (or. 
what is the same thing, EDc^^), whereas the expression (19.24) is of the order of 

§20. Improper ferroelectrics 

The theory of ferroelectrics given in § 19 is based on identifying the polarization vector of 
the crystal with the order parameter which determines the change in the crystal symmetry 
>n the phase transition. This is not always permissible, however. It may happen that the 
occurrence of spontaneous polarization does not in itself entirely determine the nature of 
the change in the crystal structure. 

It is known (see SP1 , §145) that the order parameter in a second-order phase transition 
a quantity, or a set of quantities, transformed by some irreducible (not the unit) 
representation of the symmetry group of the original (“symmetrical”) phase. The 
ransformation properties of the order parameter determine the nature of the change (the 
ccrease) in the symmetry at the phase transition. The specific physical nature of the change 
Unimportant; the order parameter may be taken as any of various physical quantities. 
Provided that they are related to one another by linear expressions and therefore have the 
^lue transformation properties. 

The choice of the vector P as the order parameter is equivalent to assuming that this is 
ransformed by the same representation as the components of a (polar) vector. If the phase 
ransition occurs with no change in the unit cell of the lattice (or with only a strain), the 
rreducible representations concerned are those of the point symmetry groups (the crystal 

^ The modulus = /i + y^lSiz, which determines the strain for a constant induction is also a constant. 
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classes) In the biaxial classes (§13), each component of a vector is transformed by oiie of the 
one-dimensional representations. The same applies to a component of a vec or 
principal axis of symmetry (threefold, fourfold or sixfold) in uniaxial „ 

represOTtations, the order parameter can be the corresponding ° 

and the theory based on the thermodynamic potential (19.1) is applicable to t . e 
clronents of P m a plane perpendicular to the principal axis of symmetry in a uniaxial 
crystal are transformed by a two-dimensional irreducible representation and serve as 
the order parameter for that representation. Lastly, in crystals with cubic symmetry, all 
three components of the vector are transformed by a single three-dimensional represen a- 
corresponds to the theory of ferroelectricity based on the thermodynamic 

however, ferroelectric transitions in which the order parameter is 
transformed by an irreducible representation of the symmetrical phase which does i^ot 
correspond to the components of a vector. In such cases, the order 
nolarization but a physically different quantity; the spontaneous polarization arises as a 
slS effect (it is assumed, of course, that the symmetry of the unsymmetrical pha. 
allows pyroelectricity). These substances are called improper ferroelectncs they differ 
considerably from ordinary ferroelectrics as regards the nature of the anom^ , 

alies t Here belong all ferroelectric transitions in which the unit cell changes l e. 
the translational symmetry of the lattice changes (the corresponding 
tations are certainly not realized by vector quantities invariant under ^ 

they may also be transitions without change of translational symmetry (the orto 
parameter is transformed by an irreducible representation of the point group, not 

corresponding to the components of a vector). ~,r>ptrv is ertirelv 

In an ordinary ferroelectric transition, when the change in «y™try is en re^ 
determined by the polarization vector, the transition is to a higher ^"^h-group (th^ al 
pyroelectricity) of the space group of the original (non-pyroelectnc) phase. In a" ‘mp^^ 
Loelectric transition, the pyroelectric phase belongs to a sub-group of lower symm^-1 
The specific thermodynamic properties of improper ferroelectrics > ^ 

siderably, like the transformationaT properties of quantities that 
different irreducible representations of the space groups. Le us now consider (ag 
terms of the Landau theory of phase transitions) one formal example to illustrat 


important fundamental points. n, r ^ nvmelectrk 

We shall take a transition (without change in the unit ce ) ro . . 

crystal of the class C,. to the class C„ which allows spontaneous ^ 1 , 1 ,1 

parameter having two components h, and '"'ftira.ion vSm in tW 

representation E. of the group C„; thecomPO"“« P • f P»esLation , 


representation£,„oi inegioup - x, ^ f 

plane perpendicular to the C, avis being ('’'f 


the order parameter iji- m and the polarization P^, y or err vector f 

must be mixed invariants formed from the same quantities, proda^' 

There are two such invariants in this case: the real and imaginary parts of the pro I 
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{rii+i^2)^{Px + iPy)- We thus obtain an expansion in the form 

2) = <i>o + a(7'-7;)r;^ + Br?‘^ + K:P^+Cif?^ [Px(yt^-y2^)-'^Pyyiy2)] 

+ C^r,^ [P^{y^^ - + IP^y^y^ ] - E • P - E^/Sn, (20.1) 

where t]^ = + li = nJn', the vectors E and P are in the xj-plane. 

The order parameter and the polarization are determined by the condition that fl* be a 
minimum (for constant E). Here we shall give only some characteristic results that are 
evident without actually making the calculation. The order parameter in the unsym- 
metrical phase is found to be proportional to {T^ — T) ‘ as for any second-order transition 
in the Landau theory. The polarization arises as an effect of the second order in t], and is 
therefore proportional to — T. The dielectric susceptibility does not tend to infinity as T 
as in ordinary ferroelectrics, because it is not here determined by a coefficient of rj^ 
that tends to zero. It does, however, have a finite discontinuity at the transition point. This 
is because, in the symmetrical phase, the order parameter t] =0 and is not affected by the 
field E; in the unsymmetrical phase it does change, and this gives a further contribution to 
the susceptibility. 

An improper ferroelectric transition is possible only when the order parameter has more 
than one component. With a one-component parameter rj, the only possible mixed 
invariant linear in P is rjP^, where P^ is one component of the vector P (since is an 
invariant for a one-dimensional representation). This would mean, however, that rj and P^ 
had the same transformational properties, and therefore that P^ itself could be chosen as 
the order parameter. 
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STEADY CURRENT 


§21. The current density and the conductivity 


Let us now consider the steady motion of charges in conductors, i.e. st^dy electric 
currents. We shall denote by j the mean charge flux density or electric current density.f n a 
steady current, the spatial distribution of j is independent of time, and satisfies the equation 


div j = 0, 


( 21 . 1 ) 


which states that the mean total charge in any volume of the conductor remains constant. 

The electric field in the conductor in which a steady current flows is constant, and 
therefore satisfies the equation 

curl E = 0, (21.2) 


i.e. it is a potential field. . • j f tu.m 

Equations (21.1) and (21.2) must be supplemented by an equation relating j and E. 
equation depends on the properties of the conductor, but in the great majority of cases i 
may be supposed linear (Ohm’s law). If the conductor is homogeneous and isotropic, the 
linear relation is a simple proportionality; 


j = uE. (2^-^) 

The coefficient a depends on the nature and state of the conductor; it is called the electrical 

^‘'inThomogeneous conductor, u = constant and, substituting (21.3) in we have 

div E = 0. In this case the electric field potential satisfies Laplace s equation. A (p • 
At a boundary between two conducting media, the normal component of the curren 
density must, of course, be continuous. Moreover, by the genera ^o^^itio" 
tangential field component be continuous (which follows from curl E 0, c _ ( ^ ) 
(6.9)), the ratio ijc must be continuous. Thus the boundary conditions on the curre 
density are . (21-4) 


or. as conditions on the field. 


0lE„l = C2E„2, 


(21.5) 


t In this chapter we ignore the magnetic field d 
the current. If this effect is to be taken into account, 
do in §30. 


ue to the current, and therefore the 
the definition of the current density i 


reaction of that field ot> 
must be refined, which v/C 
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§21 The current density and the conductivity 

/yi a boundary between a conductor and a non-conductor we have simply — 0, or 

= 0 -+ 

An electric field in the presence of a current does mechanical work on the current- 
carrying particles moving in the conductor; the work done per unit time and volume is 
evidently equal to the scalar product j-E. This work is dissipated into heat in the 
conductor. Thus the quantity of heat evolved per unit time and volume in a homogeneous 
conductor is 

j'E = oE^ = j^/a. (21.6) 

This is Joule's law.t 

The evolution of heat results in an increase in the entropy of the body. When an amount 
of heat dC = j • E d F is evolved, the entropy of the volume element d V increases by dQ /T. 
The rate of change of the total entropy of the body is therefore 

d5^/dt = {(j-E/TldF. (21.7) 

Since the entropy must increase, this derivative must be positive. Putting j = crE, we see 
that the conductivity a must therefore be positive. 

In an anisotropic body (a single crystal), the directions of the vectors j and E are in 
general different, and the linear relation between them is 

Ji = (21.8) 

where the quantities form a tensor of rank two, the conductivity tensor, which is 
symmetrical (see below). 

The following remark should be made here. The symmetry of the crystal would admit 
also an inhomogeneous term in the linear relation between j and E. giving J, = a^^E^^ +ji^°\ 
with a constant vector. The presence of this term would mean that the conductor was 
“pyroelectric”, there being a non-zero field in it when j = 0. In reality, however, this is 
impossible, because the entropy must increase: the term -E in the integrand in (21.7) 
could take either sign, and so AST j At could not be invariably positive. 

Just as, for an isotropic medium, AST j At > 0 leads to cr > 0, so for an anisotropic 
medium this condition means that the principal values of the tensor must be positive. 

The dependence of the number of independent components of the tensor on the 
symmetry of the crystal is the same as for any symmetrical tensor of rank two (see §13): for 
biaxial crystals, all three principal values are different, for uniaxial crystals two are equal, 
and for cubic crystals all three are equal, i.e. a cubic crystal behaves as an isotropic body as 
•“cgards its conductivity. 

The symmetry of the conductivity tensor 

Og^ = (21.9) 

consequence of the symmetry of the kinetic coefficients. This general principle, due to L. 

nsager, may be conveniently formulated, for use here and in §§26~28, as follows (see 

1 , § 120 ). 

+ It should be noticed that the equations curl E = 0, div (uE) = 0 and the boundary conditions (21.5) thereon 
■ite formally identical with the equations for the electrostatic field in a dielectric, the only difference being that e is 
replaced by o. This enables us to solve problems of the current distribution in an infinite conductor if the solutions 
w the corresponding electrostatic problems are known. When the conductor is bounded by a non-conductor this 
analogy does not serve, because in electrostatics there is no medium for which e = 0. 

t In Russian “Joule and Lenz’s law". 
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We define also the quantitie 


Steady Current 

. be some quantities which characterize the state of the body at every point. 


where S is the entropy of unit volume of the body, and the derivative is taken at constant 
energy of the volume. In a state close to equilibrium, the quantities x, ^e close t 
equmbriurn values, and the X, are small. Processes will occur in the ^ 

bring it into equilibrium. The rates of change of the quantities x, at each point are usually 
functions only of the values of the x, (or XJ at that point. Expanding these functions in 
powers of X, and taking only the linear terms, we have 


Then we can assert that the coefficients (the kinetic coefficients) are symmetrical with 
respect to the suffixes a and b:t ^ (21.12) 


In order to make practical use of this principle, it is necessary to choo^the quantities ^ 


don^rer^siSy by mi^ formiila for the rate of change of the total entropy of the 

( 21 - 13 ) 


and then to determine the X„. This can usually be 


where the integration is extended over the whole volume of the body. 

When a current flows in a conductor, d5^/dt is given by (21.7). Comparing this with 
(2113) we see that, if the components of the current density vector j are taken as t 
quantifies x„ then the quantities X, will be the components of the ''“tor E/T A 

comparisonofformulae(21.8)and (21.11)shows that the kineticcoefficients in this case are 

the components of the conductivity tensor, multiplied by T. Thus the symmetry of 
tensor follows immediately from the general relation (21.12). 


Problem 1. A system of electrodes maintained at constant potentials is 
medium. A current flows from each electrode. Determine the total amount of Joule heat evolve 


Solution. The required amount of heat Q is given by the integral 

Q = (j-Edt ^ - J'j-grad<^dK= - Jdiv (d>j)dt. 


Problem 2. Determine the potential distribution in a conducting sph 

O and leaving at the point O' diametrically opposite to O. 


ON. Near O and O' (Fig. 151 the ^I'JunStatisblTp^ie’f^^^^ 

ly, and Ri being the distances from O and O . Inese luncuon 
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The current density and the conductivity 


o 



integrals — rr -df over infinitesimal caps about O and O' are equal to ± i. We seek the potential at ai 

arbitrary point P in the sphere in the form 


where <pisa solution of Laplace’s equation having no poles in or on the sphere. It is evident from symmetry that \j/, 
like </), is a function of the spherical polar coordinates r and 6 only. 

On the surface of the sphere (r = c) we must have d<p/dr = 0. Differentiating, we find the boundary condition 

lf/(f, 6) is any solution of Laplace’s equation, then the function 




Substituting ^ — •v/(c^ + r^ +2crcos6) and effecting the integration, we have finally 

^ = /-{--^ + f(sinh ■"±^-sinh-^^^)}, 
2ncr LRi R 2 2a\ rsinfi rsinO )] 

'f^ = 0whenr = 0. 

'"'nimum.'^ 


+ This is easily seen either by direct calculation or from the fact 
“upending only on r and 0 can be written/= 5;c„r"P„(cos 6), wher 
Polynomials. 


that any solution/(r, 0) of Laplace’s equation 
e the c„ are constants and the P„ are Legendre 



Steady Current 


Solution The minimum concerned is that of the integral fj-Edf 'r^^'divild^f'^where 

coSndivi = 0(conservationofcharge) VarymgwithrespecttoiM 

2<h is an undetermined Lagrange multiplier, and equating the result to zero, we 
j = - (T grad<^ or curl (j/tr) = 0, which is the same as (21.2) and (21.3). 


§22. The Hall effect 

If a conductor is in an external magnetic field H. the relation between the current denshy 
and the electric field is again given by j; = but the components of the conductivity 

tensor a-, are functions of H and, what is particularly important, they are no longer 
symmetrical with respect to the suffixes i and k. The symmetry of this tensor was proved in 
§21 from the symmetry of the kinetic coefficients. In a magnetic field, however this 
nrinciple must be formulated somewhat differently: when the suffixes are interchanged, the 
direction of the magnetic field must be reversed (see SP 1, §120). Hence we now have for 
the components UiJH) the relations 

lTiJH) = (T,,(-H). 


( 22 . 1 ) 


The quantities cr, JH) and cr*i(H) are not equal. 

Like any tensor of rank two, a,, can be divided into symmetrical and antisymmetr.cal 
parts, which we denote by and 


f^ik = ^ik + ^ik- 

By definition 

Si*(H) = s,,(H), a,*(H) = 

and from (22.1) it follows that 


( 22 . 2 ) 

(22.3) 


Si,(H)= s,i(-H), = %(-H), 

ai,(H) = aw(-H) = -riiJ-H). 


(22.4) 


Thus the components of the tensor S;* are even functions of the magnetic field, and those of 

a,I are odd functions. . , > 

Any antisymmetrical tensor of rank two corresponds to some axial vector, whose 

components are _ ^_^ a = a 


In terms of this vector, the components of the product can be written as those of the 
vector product E x a: ,^2 6) 

ji = (^ikEk = ^ikEk + (E X a)i, 


The Joule heat generated by the passage of the current is given by the produc j ^ 

the vectors E x a and E are perpendicular, their scalar product is zero identically, 

(22.7) 


j-E = Si;,£.£k, 

i.e. the Joule heat is determined (for a given field E) only by the symmetrical part of the 
“inh?ex?ernaf magnetic field is sufficiently weak, the 

tensor may be expanded in powers of that field. Since the function a(H) is odd, the 
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expansion of this vector will involve only odd powers. The first terms are linear in the field. 
i.e. they are of the form 

a; = (22.8) 

The vectors a and H are both axial, and the constants therefore form an ordinary (polar) 
tensor. The expansion of the even functions S;^(H) will involve only even powers. The first 
term is the conductivity in the absence of the field, and the next terms are quadratic in 
the field: 

Sih = + PikimH,H„. (22.9) 

The tensor is symmetrical with respect to i, k and /, m. 

Thus the principal effect of the magnetic field is linear in the field and is given by the term 
EX a; it is called the Hall effect. As we see, it gives rise to a current perpendicular to the 
electric field, whose magnitude is proportional to the magnetic field. It should be borne in 
mind, however, that, for an arbitrary anisotropic medium, the Hall current is not the only 
current perpendicular to E; the current also has a component in such a direction. 

The Hall effect may be differently regarded if we use the inverse formulae which express 
E in terms of the current density: E- = The inverse tensor cr“ like cr,.^ itself, can 

be resolved into a symmetrical part and an antisymmetrical part which may be 
represented by an axial vector b: 

= Pjk + a^H- ( 22 . 10 ) 

The tensor p,^ and the vector b have the same properties as and a. In particular, in weak 
magnetic fields the vector b is linear in the field. In formula (22.10) the Hall effect is 
represented by the term j x b, i.e. by an electric field perpendicular to the current and 
proportional to the magnetic field and to the current j. 

The above relations are much simplified if the conductor is isotropic.The vectors a and b 
must then be parallel to the magnetic field, by symmetry. The only non-zero components 
of the tensor p;^ are = Pyy and p^^, the field being in the z-direction. Denoting these two 
quantities by and p, and taking the current to lie in the xz-plane, we have 

E. = PJ,, Ey= -bff. £, = Pii2;. (22.11) 

Hence we see that, in an isotropic conductor, the Hall field is the only electric field which is 
perpendicular to both the current and the magnetic field. 

In weak magnetic fields, the vectors b and H are related (in an isotropic body) simply by 

b=-£H. (22.12) 

The constant R (called the Hall constant) may be either positive or negative. The form of 
® ^erms quadratic in H in the relation between E and j, which enter through the tensor p,,;, 
s easily seen from the fact that the only vectors linear in j and quadratic in H which can be 
be^structed from j and H are (j -HIH and H^j. Hence the general form of the relation 
Ween E and j in an isotropic body, as far as the terms quadratic in H. is 

E = p<°>j + £H X j j (j • H)H. (22.13) 


PROBLEM 
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from their form in this particular case. The determinant |u| is 



From the minors of this determinant we find the components of tl 
+ 

+ = {a^Oy- 

The general expressions which give these results for the particular s 
P.i={s“‘iJs| + aA}/|trl, fci = 




m of coordinates chosen are 


This completes the 


solution. 


§23. The contact potential 

In order to remove a charged particle through the ^ 

done The work required for a thermodynamically reversible removal Pf 

called the work function. This quantity is always positive; this follows immediate y "'O™ 
“a pit charge is attmcted to any neutral body, and therefore to -y 
TZ 614) T^e work in question will be denoted by c W, where c is the charge on the particle 
Ihe sign of the work potential W thus defined is the same as that of the charge on the par i 

Thrlkfu„c.io„depe„dsbc.ho„.he„a,„,eof,heccnduc.^^^^ 

state. i.e. its tempemture and density) and on that of the ehargai ' 3n and 

work fnnction for a given metal is different for the removal of a 
for the removal of an ion from the surface. It must also be emphasized that the 
function is characteristic of the surface of the conductor. It 

on the treatment of the surface and the “contamination of it. If the conductor 
crvstal then the work function is different for different faces 
?rascertain the physical nature of the dependence of the funcfion on ^ 

properties of the surface, let us establish its relation to the f f 

Lyer. If p(x) is the charge density not averaged over physically esim g 
x-axis (perpendicular to the layer), we can write Poisson m j Y 

= - 47ip. Let the conductor occupy the region x < 0. Then a first g 

— = -471 j pdx. 


and a second integration (by parts) gives 

^_^(_oo)= -47rx ] pdx + 47r J^xpdx. 


-V 00, the integral 


j pdx 
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tends very rapidly to zero (since the surface of an uncharged conductor is electrically 
neutral). Hence 

(^)( + oo)-(^)(-c») = 471 J xpdx. 


The integral on the right is the dipole moment of the charges near the surface of the body. 
These charges form a “double layer”, in which charges of opposite sign are separated and 
the dipole moment is non-zero. The structure of the double layer, of course, depends on the 
properties of the surface (its crystallographic direction, contamination, etc.). The difference 
in the work potential for different surfaces of a given conductor is determined by the 
difference in the dipole moments. 

If two different conductors are placed in contact, an exchange of charged particles may 
occur between them. Charges pass from the body with the smaller work function to that 
with the greater until a potential difference between them is set up which prevents further 
movement of charge. This is called a contact potential. 

Fig. 16 shows a cross-section of two conductors in contact {a and b) near their surfaces 
AO and OB. Let the potentials of these surfaces be (}>^ and respectively. Then the contact 
potential is <j)^^ = (}>i, — <}>a- The quantitative relation between this potential and the work 
functions is given by the condition of thermodynamic equilibrium. Let us consider the 
work which must be done on a particle with charge e to remove it from the conductor a 
through the surface AO, transfer it to the surface OB, and finally carry it into the conductor 
b. In a state of thermodynamic equilibrium, this work must be zero.t The work done on the 
particle in the three stages mentioned is eW^, e{<j)^ — 4>J, and — eW^ respectively. Putting 
the sum of these equal to zero, we find the required relation; 

<Pat=^b~^'a- ( 23 . 1 ) 

Thus the contact potential of the neighbouring free surfaces of two conductors in contact is 
equal to the difference in their work functions. 

The existence of the contact potential results in the appearance of an electric field in the 
space outside the conductors. It is easy to determine this field near the line of contact of the 
surfaces. In a small region near this line (the point 0 in Fig. 16), the surfaces may be 



t Of course, in reality a particle can pass from one conductor to another only through their su 
d not through the space adjoining them, but the work done is independent of the path. 
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regarded as plane. The field potential outside the conductors satisfies the equation 



where r and 6 are polar coordinates with origin at O; on /40 and OB the potential takes 
given constant values. We are interested in the solution which contains the lowest power of 
r; this is the leading term in an expansion of the potential in powers of the small distance r. 
The solution concerned is <j) = constant x 6. Measuring the angle 6 from AO and 
arbitrarily taking the potential on AO as zero, we have 

4> = 4>ab0/^’ 

where a is the angle AOB. Thus the equipotential lines in the plane of the diagram are 
straight lines diverging from 0. The lines of force are arcs of circles centred at 0. The field is 


(23.3) 

rde a r’ 


it decreases inversely as the distance from 0. 

As has been said above, “contact” potentials also exist between the various faces ot a 
single crystal of metal. Hence an electric field of the kind just described must exist near the 

edges of the crystal.! u fU 

If several metallic conductors (at equal temperatures) are connected together tne 
potential between the extreme conductors is, as we easily deduce from formula (23.1), 
simply the difference of their work potentials, as it is for two conductors in direct contact. 
In particular, if the metal at each end is the same, the contact potential between the ends is 
zero. This is evident, however, because if there were a potential difference between two like 
conductors, a current would flow when they were connected, in contradiction to the secon 
law of thermodynamics. 


§24. The galvanic cell 

The statement at the end of §23 ceases to be valid if the circuit includes conductors in 
which the current is carried by different means (e.g. metals and solutions of electroly es). 
Because the work function of a conductor is different for different charged particles 
(electrons and ions), the total contact potential in the circuit is not zero even when 
conductors at each end are similar. This total potential difference is called the 
force or e.mf. in the circuit; it is just the potential difference between the two UK 
conductors before the circuit is closed. When the circuit is closed, a current flows in it; this 
the basis of the operation of what are called galvanic cells. The energy which maintain 
current in the circuit is supplied by chemical transformations occurring in cell. 

When we go completely round any closed circuit the field potential must, ot co - 
return to its original value, i.e. the total change in the potential must be ^ero. 1^ 
consider, for example, a contour on the surface of the conductors. When we pass fro 
conductor to another, the potential has a discontinuity The potential drop across y 
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conductor is RJ, where J is the total current flowing through it and R is its resistance. 
Hence the total change in the potential round the circuit is — IIJR. Putting this equal 
to zero and using the facts that J is the same at every point in the circuit and is the 
electromotive force S, we find 


JTR = S, 


(24.1) 


so that the current in a circuit containing a galvanic cell is equal to the e.m.f. divided by the 
total resistance of all the conductors in the circuit (including, of course, the internal 
resistance of the cell itself). 

Although the e.m.f. of a galvanic cell can be expressed as a sum of contact potentials, it is 
very important to note that it is in reality a thermodynamic quantity, determined entirely 
by the states of the conductors and independent of the properties of the surfaces separating 
them. This is clear, because is just the work per unit charge which must be done on a 
charged particle when it is carried reversibly along the closed circuit. 

To illustrate this, let us consider a galvanic cell consisting of two electrodes of metals A 
and B immersed in solutions of electrolytes AX and BX, X ^ being any anion. Let and Cb 
be the chemical potentials of the metals A and B, and Cax and Cbx those of the electrolytes in 
solution.f If an elementary charge e is carried along the closed circuit, an ion A ^ passes into 
solution from the electrode A and an ion B^ passes out of solution to the electrode B, the 
change in the charges on the electrodes being compensated by the passage of an electron 
from AtoB through the external circuit. The result is that the electrode A loses one neutral 
atom, the electrode B gains one, and in the electrolyte solution one molecule of BX is 
replaced by one of AX. Since the work done in a reversible process (at constant 
temperature and pressure) is equal to the change in the thermodynamic potential of the 
system, we have 

eSAB={i:B-CBx)-{CA-CAx), (24.2) 

which expresses the e.m.f. of the cell in terms of the properties of the material of the 
electrodes and of the electrolyte solution. 

From (24.2) we can also draw the following conclusion. If the solution contains three 
electrolytes AX, BX, CX and three metallic electrodes A, B, C, then the e.m.f.s between 
each pair of them are related by 

+<^BC = <^AC- (24.3) 

Using the general formulae of thermodynamics, we can relate the e.m.f of a galvanic cell 
to the heat evolved when a current flows, which of course is actually an irreversible 
phenomenon. Let Q be the amount of heat generated (both in the cell itself and in the 
external circuit) when the unit charge passes along the circuit; Q is just the heat of the 
•^eaction which occurs in the cell when a current flows. By a well-known formula of 
thermodynamics (see SP 1, §91), it is related to the work by 



(24.4) 


The definition of the partial derivative with respect to temperature depends on the 
Conditions under which the process occurs. For example, if the current flows at constant 
pressure (as usually happens), then the differentiation is effected at constant pressure. 
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§25. Electrocapillarity 


The presence of charges on the boundary between two conducting media affects the 
surface tension there. This phenomenon is called electrocapillarity. In practice, the me la 
concerned are both liquids; usually one is a liquid metal (mercury) and the ot er is a 

solution of an electrolyte. l r r 

Let 1, <f)2 be the potentials of the two conductors, and c,, Cz tbe charges at the surface of 
separation. These charges are equal in magnitude and opposite in sign, and thus torm a 
double layer on the surface. 

The differential of the potential ^ of a system of two conductors at given temperature 
and pressure is, taking into account the surface of separation. 


= adS-Cid<f)i-e2d<f)2, 
where the term adS is the work done in a reversible change dS in the area S of the surface of 
separation; a is the surface-tension coefficient (see SP 1, §154). „ .. . 

The thermodynamic potential ^ in (25.1) may be replaced by its “surface part since 
the volume part is constant for given temperature and pressure, and is therefore of no 
interest here. Putting Cl = = cand the potential difference <^>i.-<^>2 = </>. we can write 

(25.1) as 

Hence 

{B^JdS)^ = a, (25.3) 


a being expressed as a function of <p. Integrating, we find that = ccS. Substitution m 
(25.2) gives d(o(S) = adS - edtj), or Sda = - ed<P, whence 


{dald(l))pj, ( 25 - 4 ) 

where a = e/S is the charge per unit area of the surface. The relation (25.4), derived by G. 
Lippmann and J. W. Gibbs, is the fundamental formula in the theory of electrocapillarity 
In a state of equilibrium, the thermodynamic potential ^ must be a minimim for giv 
values of the electric potentials on the conductors. Regarding it as a function of the sur a 
charges e, we can write the necessary conditions for a minimum as 

dgiojde = 0, > 0, (2^' ^ I 

where the derivatives are taken at constant area S. To calculate these, we express 
terms of the thermodynamic potential . 

The vanishing of the first derivative gives 


de de 


and then the condition for the second derivative to be positive becomes 


da/dtp > 0 . 


( 25 . 7 ) 
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This result was to be expected, since the double layer on the surface may be regarded as a 
capacitor with capacitance deld4>. 

Differentiating equation (25.4) with respect to (p and using (25.7), we find that 

d^a/d<p^ < 0. (25.8) 

This means that the point where da/dcp = — a = 0 is a maximum of a as a function of (p. 


§26. Thermoelectric phenomena 

The condition that there should be no current in a metal is that there is thermodynamic 
equilibrium with respect to the conduction electrons. This means not only that the 
temperature must be constant throughout the body, but also that the sum ecp + Co should 
be constant, where Co is the chemical potential of the conduction electrons in the metal (for 
<p = 0).t If the metal is not homogeneous, Co is not constant throughout the body even if 
the temperature is constant. Hence the constancy of the electric potential (p in this case does 
not mean the absence of a current in the metal, although the field E = — grad (p is zero. 
This makes the ordinary definition of (p (as the average of the true potential) inconvenient, 
if we wish to take inhomogeneous conductors into consideration. 

It is natural to redefine the potential as 0 + Co/^^ ^nd we shall write this henceforward as 
^ simply. J In a homogeneous metal, the change amounts to the adding of an unimportant 
constant to the potential. Accordingly, the “field” E= -grad(p (which we shall use 
henceforward) is the same as the true mean field only in a homogeneous metal, and in 
general the two differ by the gradient of some function of the state.§ 

With this definition, the current and field are both zero in a state of thermodynamic 
equilibrium with respect to the conduction electrons, and the relation between them is 
j = cE (or 7; = even if the metal is not homogeneous. 

Let us now consider a non-uniformly heated metal, which cannot be in thermodynamic 
equilibrium (with respect to the electrons). Then the field E is not zero even if the current is 
zero. In general, when both the current density j and the temperature gradient grad T are 
not zero, the relation between these quantities and the field can be written 

E = j/ff + o(gradr. (26.1) 

Here a is the ordinary conductivity, and a is another quantity which is an electrical 
characteristic of the metal. Here we suppose for simplicity that the substance is isotropic 
(or of cubic symmetry), and therefore write the proportionality coefficients as scalars. The 
inear relation between E and grad T is, of course, merely the first term of an expansion, but 
IS sufficient in view of the smallness of the temperature gradients occurring in practice. 
The same formula (26.1), in the form 


j = u(E-o(gradr), 


(26.2) 


(electron^^ ^ chemical potential defined in the usual manner, viz. per unit particle 

elec, J*"'" ^ formulated as follows: the new e,j> is the change in the free energy when one 

and T* brought into the metal. In other words. 4> = Sf/c>, where F is the free energy of the metal 

and p the charge on the conduction electrons per unit volume 
§ It must be emphasized that cE is then not the force on the charge e. Consequently, this definition of E which 
'S. suitable in a phenomenolopcal theory, may be inconvenient in the microscopic theory when calculating he 
•kinetic coefficients (cf. PK, §44). ^ ^ 
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shows that a current can flow in a non-uniformly heated metal even if the field E is zera 
As well as the electric current density j, we can consider the energy flux densi y > 

..Al . .nnt^ins an amount d>\ resulting simply from the fact that each charged 


As well as the electric current density j, we can consiuci me ei.e.gj — --^ ^ 

all this quantity contains an amount (pi resulting simply from the ^ft that eac c arge 
pLk5e (electron) carries with it an energy e<p. The difference q - H however, does not 
depend on the potential, and can be generally written as a linear function of the gradi nts 
tad- E and gradT, similarly to formula (26.2) for the current density. We shall for 
the present write this as 

q-(pi = )SE-ygradr. 

The symmetry of the kinetic coefficients gives a relation between the coefficient ^ and the 
coefficient a in (26.2). To derive this, we calculate the rate of change of the total entropy o 
the conductor. The amount of heat evolved per unit time and volume is - div q. Hence we 
can put 

_ fdivq ^ 

J T 

Using the equation divj = 0, we have 


^dF. 


^ - {div (q - cPi) + div (/.j} = ^ div (q - 


■0i)- 


Ej 


4>])-gradT 


dV. 


(26.3) 


T 

The first term is integrated by parts, giving 

This formula shows that, if we take as the quantities Sxjd. (see §21) the ccntpo"e„ts “t 
the veetots j and , - 4,i. then the eotrespottding quantities Jk. are the eomponents of the 
vectors -EIT and gradT/T^ Accordingly in thp relations 


j = ffT— — oaT' 


^^ gradr 


-<pi = pr -:f-yT 


t2 

^^gradr 


the coefficients aaT^ and PT must be equal. Thus ^ ^ result 

-V gradr Finally, expressing E in terms of j and gradT by (26.1), we have the 
^ ^ ■ (26.4) 


,, ^ ((/) + otT)i-Kgradr, 
where k = y - Ta^c is simply the ordinary thermal conductivity, which gives the heat m 

~diflonthatd^ 

restriction on the thermoelectric coefficients. Substituting (26.1) and (26.4) in (26. ) 

^ ^ d F > 0, (26-5) 

whence we find only that thl coefficients of thermal and electrical conductivity must be 


positive. * >1 r that an inhomogeneiiy of pressure (o^ 

In the above formulae it was tacitly assumed tnai 
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density) at constant temperature cannot cause a field (or current) to appear in the 
conductor, and consequently no term in grad p was included in (26.2) or (26.4). The 
existence of such terms would, in fact, contradict the law of the increase of entropy: the 
integrand in (26.5) would then contain terms in the products j • grad p and grad T* grad p, 
which could be of either sign, and so the integral could not be necessarily positive. 

The relations (26.1) and (26.4) indicate various thermoelectric effects. Let us consider the 
amount of heat — div q evolved per unit time and volume in the conductor. Taking the 
divergence of (26.4), we have 

Q= - div q 

= div (K grad T) + E • j + j • grad (otT), 

or, substituting (26.1), 

Q = div (k grad T )+'^-— Tj • grad a. (26.6) 

a 

The first term on the right pertains to ordinary thermal conduction, and the second term, 
proportional to the square of the current, is the Joule heat. The term of interest here is the 
third, which gives the thermoelectric effects. 

Let us assume the conductor to be homogeneous. Then the change in a is due only to the 
temperature gradient, and grad a = (dot/dT) grad T; if, as usually happens, the pressure is 
constant through the body, dot/dT must be taken as (dal8T)p. Thus the amount of heat 
evolved (called the Thomson ejfect) is 

p\ ■ grad T, where p = -T dot/dT. (26.7) 

The coefficient p is called the Thomson co^cient. It should be noticed that this effect is 
proportional to the first power of the current, and not to the second power like the Joule 
heat. It therefore changes sign when the current is reversed. The coefficient p may be either 
positive or negative. If p > 0, the Thomson heat is positive (i.e. heat is emitted) when the 
current flows in the direction of increasing temperature, and heat is absorbed when it flows 
in the opposite direction; if p < 0 the reverse is true. 

Another thermal effect, called the Peltier effect, occurs when a current passes through a 
junction of two different metals. At the surface of contact, the temperature, the potential 
and the normal components of the current density and energy flux density are all 
continuous. Denoting by the suffixes 1 and 2 the values of quantities for the two metals and 
equating the normal components of q (26.4) on the two sides, we have, since (f), T and are 
continuous, 

[-Kffridx]\= -Lr(a2-a,), 

Ibe x-axis being taken along the normal to the surface. If the positive direction of this axis 
's from metal 1 to metal 2, then the expression on the left-hand side of this equation is the 
amount of heat taken from the surface per unit time and area by thermal conduction. 
This heat loss is balanced by the evolution at the junction of an amount of heat given 
y the right-hand side of the equation. Thus the amount of heat generated per unit time 
and area is 

jUi2, where ni2 = -Tlaj-ai). (26.8) 

The quantity n 12 is called the Peltier coefficient. Like the Thomson effect, the Peltier effect 
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is proportional to the first power of the current, and changes sign when the direction of the 
current is reversed. The Peltier coefficient is additive: n|3 = ni2 + n23, where the suffixes 
12 3 refer to three different metals. 

’ A comparison of formulae (26.7) and (26.8) shows that the Thomson and Peltier 
coefficients are related by 


Pi-Pi 


(26.9) ' 


Next, let us consider an open circuit containing two junctions, the two end conductors 
being of the same metal (1 in Fig. 17). We suppose that the junctions fc and c are at different 
temperatures and T2, while the temperature at each end (a and d) is the same. Then there j 
is a potential difference called a thermoelectromotive force, which we denote by between | 

the ends. 

° I f z ' ' 


Fig. 17 

To calculate this force, we put j = 0 in (26.1) and integrate the field E = a grad T along the 
circuit (taken to be the x-axis): 

„^dx = i«dr. 

The integrations from atob and from c to d are over temperatures from T 2 to T, in metal 1, 
and that from fc to c is over temperatures from Ti to 72 in metal 2. Thus 

T2 

j(a2-«i)dT. (2610) 

Comparing this with (26.8), we see that the thermo-e.m.f. is related to the Peltier coefficient 
by 

<^T= - 

n 

Formulae (26.9) and (26.11) are called Thomson’s relations (W. Thomson, 1854). 

To conclude this section, we shall give the formulae for the current and heat flux m 
anisotropic conductor. These are derived from the symmetry of the kinetic coethcient 
the same way as formulae (26.1) and (26.4), and the results are 

Ei = G-^J^ + ai^dT/dx^, 1 (26.12) 

Qi - <i>h = TdT! dx^. j 

Here <t“T. is the tensor inverse to the conductivity tensor <7;*, and the tensors <7;^ and 
are symmetrical. The thermoelectric tensor a,„ however, is m general not symmetrical- 


Cl 
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§27. Thermogalvanomagnetic phenomena 


There is a still greater variety of phenomena which occur when a current flows in the 
simultaneous presence of an electric field, a magnetic field, and a temperature gradient. 

The discussion is entirely similar to that given in §26 for thermoelectric effects. It will be 
given here in tensor form, so as to be applicable to both isotropic and anisotropic 
conductors. We write the electric current density j and the heat flux q as 


(27.1) 


where all the coefficients are functions of the magnetic field. The symmetry of the kinetic 
coefficients gives 

= d,,(H) = d,,(-H), I 


Expressing E and q —0] in terms of j and gradT from (27.1), we have 
qi - 4>ji = PiJk - Kik 8T/dx^, J 


(27.3) 


where the tensors a *, a,)?, k are certain functions of the tensors a, b, c, d, and have the 
following symmetry properties resulting from (27.2): 


‘iic(H) = ff \i(-H), 

K^k (H) = K,, (- H), /i, JH) = Ta,, (- H). 


(27.4) 


These are the required relations in their most general form. They generalize those found in 
§26 for the case where there is no magnetic field and in §22 for the case where there is no 
temperature gradient. It must be emphasized that in an anisotropic conductor the tensors 
<^ik and are in general not symmetrical even when there is no magnetic field. 

The tensors ct~^, k, and P + Tcn can be resolved into symmetrical and antisymmetrical 
parts (cf §22). In a weak magnetic field, the symmetrical parts may be regarded as 
constants independent of H, while the antisymmetric parts are linear in H. For an isotropic 
conductor we have, to this accuracy. 


E = j/ff + a grad KH xj + NH x grad T, (27.5) 

q — = otFj — K grad T + NTH x j + LH x grad T. (27.6) 

Here a and k are the ordinary coefficients of electrical and thermal conductivity, a. is the 
thermoelectric coefficient which appears in (26.1), R is the Hall coefficient, and N and L are 
new coefficients. The term NH x grad F may be regarded as representing the effect of the 
magnetic field on the thermo-e.m.f. (called the Nernsi effect), and the term FH x grad Fas 
representing the effect of this field on the thermal conduction (called the Leduc-Righi 
effect). 

At a boundary between media, the normal components of the vectors j and q are 
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continuous, and therefore so is that of the vector -Kgradr+otTj + NrHxj 
+ LH X grad T. The term NTH xj gives the influence of the magnetic field on the Peltier 
effect (called the Ettingshausen effect). 

The amount of heat evolved in the conductor per unit time and volume is Q = - div q. 
Here we must substitute q from (27.6) and replace - grad 0 = E in accordance with (27.5). 
If the conductor is homogeneous, then the quantities a, N, L, etc. are functions of 
temperature alone, and so their gradients are proportional to grad T. In the calculation we 
neglect all quantities of the second order in H, and to the same approximation we can take 
curl (i/ff) S curl E = 0. We also note that the external field H (arising from sources 
outside the conductor under consideration) is such that curl H = O.f Finally, divj = 0, as 
for any steady current. The result is 

Q = ~ + div {K grad T) -7j • grad « + ^ ^ ^ H) • grad T. 

The third term here gives the Thomson effect (26.7), and the last term gives the change in 
the Thomson effect resulting from the presence of the magnetic field. 


§28. Diffusion phenomena 

The presence of diffusion causes certain phenomena in electrolyte solutions which do 
not occur in solid conductors. We shall assume, for simplicity, that the temperature is the 
same everywhere in the solution, and so consider only pure diffusion phenomena, 
uncomplicated by thermoelectric effects. 

Instead of the pressure P and the concentration c, it is more convenient to take as 
independent variables the pressure and the chemical potential (. We here define C as the 
derivative of the thermodynamic potential of unit mass of the solution with respect to its 
concentration c (at constant P and T); by the concentration we mean the ratio of the mass 
of electrolyte in a volume element to the total mass of fluid in the same volume.f It may be 
recalled that the constancy of the chemical potential is (like that of the pressure and the 
temperature) one of the conditions of thermodynamic equilibrium. 

The definition of the electric field potential given in §26 has to be somewhat modified m 
this case, since the current is now carried by the ions of the dissolved electrolyte, and not 1^ 
the conduction electrons. A suitable definition is (cf. the third footnote to §2 ) 
(j) = (dO/Bp)^, where 0) is the thermodynamic potential and p the sum of the ion charges in 
unit volume of the solution (after differentiating we put p = 0, of course, because t e 


t This neglects the weak effect on the evolution of heat resulting from the magnetic fields of the currents 

“TTllrchemical potentials are usually defined as f, = cQ>/8n„C, = dO/Br,,, where 0. is the ‘her^ynamic 
potential of any mass of the solution, and n,. n^ the numbers of particles of solme and solvent in ‘hat 
Llution. If d) is the thermodynamic potential per unit mass, then ‘lip „ 

Him, + = 1 (where m,, are the masses of the two kinds of particle), and the concentration c - n, 

Hence we have 


dO dtij dO dn2 

^ dc dtii dc dn2 dc 

ttential as here defined. 
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solution is electrically neutral). The derivative is taken at constant mass concentration, i.e. 
at a given sum of the masses of ions of both signs in unit volume.^ 

When a gradient of the chemical potential is present, a term proportional to it is added 
to the expression for the current density; 

i = ff(E-/S grade), (281) 

in analogy with the added term in (26.2). We shall see below that, for a given gradient of the 
chemical potential (and of the temperature), j must be independent of the pressure 
gradient, and so no term in gradP appears in (28.1).J 
As well as the electric current, we have to consider the transport of the mass of the 
electrolyte which takes place at the same time. It must be borne in mind that the passage of 
a current through the solution may be accompanied by a macroscopic motion of the fluid. 
The mass flux density of the electrolyte resulting from this motion is pev, where v is the 
velocity and p the density of the solution. The electrolyte is also transported by molecular 
diffusion. We denote the diffusion flux density by i, so that the total flux density is pev + i. 
The irreversible processes of diffusion cause a further increase in entropy; the rate of 
change of the total entropy is§ 

(28.2) 

Like the electric current density, the diffusion flux may be written as a linear 
combination of E and gradC, or of j and gradC- Using the symmetry of the kinetic 
coefheients, we can relate one of the coefficients in this combination to the coefficient li in 
(28.1), in exactly the same way as we did for j and q - 0] in §26. The result is 

* = “ ^ ■*" 

(ac/5c)„ 

The coefficient of grad C is here expressed in terms of the ordinary diffusion coefficient, 
p being the density of matter. For j = 0 and constant pressure and temperature we have 
i = — pD grad c. 

The inadmissibility in (28.1) and (28.3) of terms proportional to the pressure gradient 
follows, as in §26, from the law of the increase of entropy: such terms would make the 
derivative of the total entropy (28.2) a quantity of variable sign. 

Formulae (28.1) and (28.3) give all the diffusion phenomena in electrolytes, but we shall 
not pause here to examine them more closely. 

PROBLEM 

Two parallel plates of a metal /I are immersed in a solution of an electrolyte AX. Find the current density as a 
function of the potential difference applied between the plates. 


In a strong electrolyte, the solute is completely dissociated, and so the mass concentration may be .^ritt^ as 
r = m « + m n where and m_ are the cation and anion masses, n+ and n_ their number densities. With 
[he ab;v;definition of the potential. ^ = 0 corresponds to C ./m. = C- /m_ for the cation and an.on chemical 

potentials, which are also related by +C- = Ci. h ,t„. ^mssnre 

t It should be emphasized, however, that, for a given concentration gradient j does depend on the pressure 

grade = (SC/dc)pj^g.iaic + (eC/eP),_rgradP. 


§ The derivation of the second term is given in FM, §57. 
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Solution. When the current passes, metal is dissolved from one plate and deposited on the other. The 
solvent (water) remains at rest, and a mass flux of metal of densityf pv = jm/e occurs in the solution, where; is the 
electric current density, and m and e are the mass and charge of an ion A *. This flux is also given by i + prc, where 
i is as shown in (28.3); assuming the pressure constant throughout the liquid,! we have 


where x is the coordinate in the direction of a line joining the electrodes. Since; = constant in the solution, this 
gives 


where Ci, Cj are the concentrations at the surfaces of the plates, and 1 is the distance between them. 

The potential difference i between the plates is most simply found from the total amount of energy Q 
dissipated per unit time and unit area of the plates, which must equal ji. By (28.1), (28.2) we have 




and therefore, using (1), 




(3) 


Formulae (2) and (3) implicitly solve the problem. 

If the current; is small, the concentration difference Cj - Cj is also small. Replacing the integrals by Cj -1; 
times the integrands, we find the effective specific resistance of the solution: 


£ 

// 


pDdc\ 




The first term in (3) gives the potential drop (J (j/o) dx) due to the passage of the current. The second term is the 
e.m.f. due to the concentration gradient in the solution (in a certain sense analogous to the thermo-e.ni.f.). This 
latter expression is independent of the conditions of the particular one-dimensional problem considered, and is 
the general expression for the e.m.f. of a “concentration cell”. 


t It may be recalled that the hydrodynamic velocity v in a solution is defined ‘hat P s the momentuin 
unit volunle of the liquid; see FM, §57. Hence the fact that in this case only the dissolved metal is moving (relative 
to the electrodes) does not affect the calculation of pv. r i,- i,„ j 

t The change in pressure due to the motion of the liquid gives only terms of a higher order of smallness. 
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STATIC MAGNETIC FIELD 


§29. Static magnetic field 

A STATIC magnetic field in matter satisfies two of Maxwell’s equations, obtained by 
averaging the microscopic equations 

divh = 0, curlh =-^ + —pv. (29.1) 

cot c 

The mean magnetic field is usually called the magnetic induction and denoted by B: 

h = B. (29.2) 

Hence the result of averaging the first equation (29.1) is 

divB = 0. (29.3) 

In the second equation, the time derivative gives zero on averaging, since the mean field is 
supposed constant, and so we have 

curlB = (Anlc)py. (29.4) 

The mean value of the microscopic current density is in general not zero in either 
conductors or dielectrics. The only difference between these two classes is that in 
dielectrics we always have 

j^•df=0, (29.5) 

where the integral is taken over the area of any cross-section of the body; in conductors, 
this integral need not be zero. Let us suppose to begin with that there is no net current in 
the body if it is a conductor, i.e. that (29.5) holds. 

The vanishing of the integral in (29.5) for every cross-section of the body means that the 
'sector p\ can be written as the curl of another vector, usually denoted by cM: 

p\ = c curl M, (29.6) 

Where M is zero outside the body; compare the similar discussion in §6. For, integrating 
°ver a surface bounded by a curve which encloses the body and nowhere enters it, we have 
1 Pv • df = c I curl M-df=c|M-dl =0. The vector M is called the magnetization of the 
*^ody. Substituting it in (29.4), we find 

curl H = 0, (29.7) 

Where the vector H and the magnetic induction B are related by 
B = H + 47CM, 


(29.8) 



f 
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which is analogous to the relation between the electric field E and induction D. Although 
H is, by analogy with E, usually called the magnetic field, it must be remembered that the 
true mean field is really B and not H. 

To see the physical significance of the quantity M, let us consider the total magnetic 
moment due to all the charged particles moving in the body. By the definition of the 
magnetic moment (see Fields, §44), this ist 

jr dF/2c = yjr Xcurl MdE. 

Since p\ = 0 outside the body, the integral can be taken over any volume which includes 
the body. We transform the integral as follows: 

jr xcurl MdF = - f r X (M xdf) - j(M X grad) xrdF. 

The integral over the surface outside the body is zero. In the second term we have 
(M X grad) Xr = - Mdivr + M= - 2M. Thus we obtain 

^ jrx^dF= jMdF. (29.9) 


We see that the magnetization vector is the magnetic moment per unit volume.f 
The equations (29.3) and (29.7) must be supplemented by a relation between H and B in 
order to complete the system of equations. For example, in non-ferromagnetic bodies in I 
fairly weak magnetic fields, B and H are linearly related. In isotropic bodies, this linear 
relation becomes a simple proportionality: 

B = pH. (29.10) 


The coefficient p is called the magnetic permeability. We also have M - xH, where the 


coefficient 


(29.11) 


is called the magnetic susceptibility. 

Unlike the permittivity e, which always exceeds unity, the magnetic permeability may be 
either greater or less than unity. (It is, however, always positive, as we shall prove in §31. ] 
The reason for the differing behaviour of p and e is discussed in §32.) The magnetic 
susceptibility x may correspondingly be either positive or negative. 

Another, quantitative, difference is that the magnetic susceptibility of the great majority 
of bodies is very small in comparison with the dielectric susceptibility. This difference 
arises because the magnetization of a (non-ferromagnetic) body is a relativistic effect, o 
order r^/c^, where v is the velocity of the electrons in the atoms.§ 

In anisotropic bodies (crystals), the simple proportionality (29.10) is replaced by t c 
linear relations 


t For clarity, it should be emphasized that r here is a variable coordinate of integration, not the position 
of a microscopic particle; it therefore does not come under the averaging sign. , ,,g inside the 

t The quantity M is completely determined only when this relation is to M ns de the 

body, and M = 0 outside it, do not uniquely define M: the gradient of any scalar could be added to M ins.de tr 
bodv without affecting (29.6) (cf. the similar remark in the first footnote to ^ ^ 

6 The ratio v/c appLrs with H in the Hamiltonian of the interaction of the body with the magnetic field, and 


le atoms or molecules. 
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§30 The magnetic field of a steady current 

The magnetic permeability tensor /r,* is symmetrical; this follows from the thermodynamic 
relations to be derived in §31, in exactly the same way as for (§13). 

From the equations div B = 0, curl H = 0 it follows (cf §6) that at a boundary between 
two different media we must have 

Bi„ = B2„, Hi, = H^,. (29.13) 

This system of equations and boundary conditions is formally identical with those for the 
electrostatic field in a dielectric in the absence of free charges, differing only in that E and 
D are replaced by H and B respectively. Since curl H = 0, we can put H = - grad i]/; the 
equations for the potential ^ are the same as those for the electrostatic potential. Thus the 
solutions of the various problems of electrostatics discussed in Chapter II can be 
immediately applied to problems with a static magnetic field. In particular, the formulae 
derived in §8 for a dielectric ellipsoid in a uniform electric field hold also, with appropriate 
substitutions, for a magnetic ellipsoid in a uniform magnetic field. For example, the 
magnetic field H**' and induction B‘‘' inside the ellipsoid are related to the external 
field § by 

(B^<'> - H„<") = (29.14) 

where n^^ is the demagnetizing factor tensor. This relation is valid, whatever the relation 
between B and H. 

The tangential component of the magnetic induction, unlike its normal component, is 
discontinuous at a surface separating two media. The magnitude of the discontinuity can 
be related to the current density on the surface. To do this, we integrate both sides of 
equation (29.4) over a small interval A/ crossing the surface along the normal. We then let 
A/ tend to zero; the integral Jpv d/ may tend to some finite limit. The quantity 

g = Jpvd/ (29.15) 

may be called the surface current density; it gives the charge passing per unit time across 
unit length of a line in the surface. We take the direction of g at a given point on the surface 
as the y-axis, and the direction of the normal from medium 1 to medium 2 as the x-axis. 
Then the integration of equation (29.4) gives 



Since is continuous, the derivative dB^ jdz is finite, and so its integral tends to zero with 
A/. The integral of dB^/dx gives the difference in the values of B^ on the two sides of the 
surface. Thus 622-^12 = —^ng/c. This can be written in vector form: 

47rg/c = n X (B2 - Bi) = 47rn X (M2 - Mj), (29:16) 

where n is a unit vector along the normal into region 2; the last member of (29.16) is 
obtained by using the continuity of the tangential component of H. 

§30. The magnetic field of a steady current 

If a conductor carries a non-zero total current, the mean current density in it can be 
written as ccurlM+j. The first term, resulting from the magnetization of the 
medium, makes no contribution to the total current, so that the net charge transfer through 
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a cross-section of the body is given by the integral J j • df of the second term. The quantity j 
is called the conduction current density.^ The statements made in §21 apply to this current; 
in particular, the energy dissipated per unit time and volume is E ■ j. 

The distribution of the current j over the volume of the conductor is given by the 
equations of §21, which do not involve the magnetic field due to j itself, if we neglect the 
effect of this field on the conductivity of the body. Hence the magnetic field of the currents 
must be determined for a given current distribution. The equations satisfied by this field 
differ from those in §29 by the presence of a term 47tj/t' on the right-hand side of (29.7); 

divB = 0, (30-1) 

curlH = 47tj/c. (30.2) 

The conduction current density j, which is proportional to the electric field, does not 
become infinite, and in particular is finite on a surface separating two media. Hence the 
term on the right of (30.2) dofe^not affect the boundary condition that the tangential 
component of H be continuous, 

To solve equations (30.1), (30.2), it is convenient to use the vector potential A, defined by 
B = curl A, (30.3) 

so that equation (30.1) is satisfied identically. Equation (30.3) does not uniquely define the 
vector potential, to which the gradient of any scalar may be added without affecting (30.3). 
For this reason we can impose on A a further condition, which we take to be 

div A = 0. (30.4) 

The equation for A is obtained by substituting (30.3) in (30.2). If the linear relatior 
B = /iH holds we have 

curl^—curlAJ = 47tj/c. (30.5) | 


In this form the equation is valid for any medium, homogeneous or not. 
In a homogeneous medium, p = constant, and since 

curl curl A = grad div A — AA = —AA 


we find from (30.5) 
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A A = -4npilc. 

If we have two or more adjoining media with different magnetic permeability p, 
general equation (30.5) has the form (30.6) in each homogeneous medium, while at the 
interfaces the tangential component of the vector (l/p) curl A must be continuous 
Moreover, the tangential component of A itself must be continuous, since a discontinui 
would mean that the induction B was infinite at the boundary. ... 

The field equations are simpler in the two-dimensional problem of finding the magne 
field in a medium infinite and homogeneous in one direction (w ic we ta ^ 
direction), the currents which produce the field being everywhere in that direction, with t 


s called the molecular 


t The quantity c curl M il- 

complete accordance with the actual physical picture of motion of charges im 
the conduction electrons, as well ,r, the atoms eontribut 


density. This name, however, is not if 
;onductor. For example, in a meta' 
contribute to the magnetization. 
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current density = j depending only on x and y. We make the plausible assumption (to be 
confirmed by the result) that the vector potential of such a field is also in the z-direction: 

= A(x, y). The condition (30.4) is then satisfied identically; the magnetic field is 
everywhere parallel to the xy-plane. We denote by k a unit vector in the z-direction; then 

curl A = curlAk = grad A xk. 


^ / grad A 


gradA 


Hence equation (30.5) becomes 




- jix,y), 


(30.7) 


i.e. we in fact obtain one equation for the one scalar quantity A (x, y). For a piecewise 
homogeneous medium, (30.7) becomes 

AA = -4ngj{x, y)/c, (30.8) 


with the boundary condition that A and {llfi)dA/dn be continuous at an interface.! 

The magnetic field is easily found if the current distribution is symmetrical about the z- 
axis: j, = j{r) (where r is the distance from that axis). In this case the lines of magnetic force 
are evidently the circles r = constant. The magnitude of the field is found at once from the 
formula 

^ H-dl jj-df, (30.9) 

which is the integral form of (30.2). Thus 

H(r) = 2J(r)/cr, (30.10) 


where J (r) is the total current within the radius r. 

The reduction of the vector equation (30.5) to a single scalar equation is possible also if 
the current distribution is axially symmetrical and has in cylindrical polar coordinates r, (j), 
^ the form = 0,j^ = j(r, z). We seek the vector potential in the form A^ = A^ = 0, A^ 

~ A{r, z). The components of the magnetic induction B = curl A are = —dA/dz, 

~ {h'r)d(rA)/dr, B^ = 0, and the (^-component of equation (30.2) gives 


dz\/r dz ) 


dr 


jir dr 


M) 


(30.11) 


The equations for the magnetic field of the currents can be solved in a general form in the 
jntportant case where the magnetic properties of the medium may be neglected, 
• s- where we can put ^ = 1. The vector potential then satisfies in all space the equation 


should be noticed that the two-dimensional problem with a static magnetic field is equivalent to the 
o-dimensional electrostatic problem of determining the electric field due to extraneous charges with density 
J? (x, y) in a dielectric medium. The equation to be solved in the latter problem is div (e grad (j>) = - where 

's the field potential; this differs from (30.7) only in that A,j/c and p are replaced by (j,. and 1/e respectively. 
Oe boundary conditions on A and (p are the same. A difference occurs, however, on passing to E and B from d> 
snd A respectively. The vectors E = - grad </> and B = curl A are the same in magnitude but in perpendicular 
'•'tections at any given point. 
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A A = - 47t j/c with no conditions at the interfaces between different media (including the 
surface of the conductor on which the current flows). The solution of this equation which 
vanishes at infinity is 

A = (30.12) 

cJr 

where R is the distance from the volume element d 1' to the point at which A is to be 
calculated (see Fields, §43). In taking the curl of this equation, we must remember that the 
integrand j, R is to be differentiated with respect to the coordinates of this point, of which j 
is independent, so that 

curl (j/R) = grad (1/R) x j = — R x j/R^, 
where the radius vector R is from d f" to the point under consideration. Thus 


(30.13) 

If the conductor on which the current flows is sufficiently thin (a thin wire), and if we are 
interested only in the field in the surrounding space, the thickness of the wire may be 
neglected. In what follows we shall often discuss such linear currents. The integration over 
the volume of the conductor is then replaced by an integration along its length: the 
formulae for linear currents are obtained from those for volume currents by making the 
substitution j d 1/ -► J dl, where J is the total current in the conductor. For example, from 
formulae (30.12) and (30.13) we have 


A 


r dixR 


(30.14) 


The latter formula is Biot and Savart’s law. 

This simple formula for the magnetic field of a linear current does not depend on the 
assumption that ^ = 1. Since we neglect the thickness of the conductor, no boundary 
conditions at its surface need be applied, and the magnetic properties of the conducting 
material are of no importance (it may even be ferromagnetic). The solution of equation 
(30.6) for the field in the medium surrounding the conductor is therefore 

( 30 . 15 ) 

c J R’ c J R 

whatever the magnetic susceptibility of that medium. Thus the presence of the medium 
simply changes the magnetic induction by a factor g. The field H = B/ii is unchanged. 

The problem of determining the magnetic field of linear currents can also be solved as a 
problem of potential theory. Since we neglect the volume of conductors, we are in fact 
determining the field in a region containing no currents except along certain line 
singularities. In the absence of currents, a static magnetic field has a scalar potential, which 
in a homogeneous medium satisfies Laplace’s equation. There is, however, an important 
difference between the magnetic field potential and the electrostatic potential: the latter is 
always a one-valued function, because curl E = 0 in all space (including charged regions) 
and so the change in the potential in going round any closed contour (i.e. the circulation of 
E round that contour) is zero. The circulation of the magnetic field round a contour 
enclosing a linear current is not zero, but 4nJ/c. Hence the potential changes by this 
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amount on each passage round a contour enclosing a linear current, i.e. it is a many-valued 
function. 

If the currents lie in a finite region of space (and = 1 everywhere), the vector potential 
of the magnetic field at a great distance from the conductors is 

A = .^xR/R^ (30.16) 

,^=JrxjdF/2c (30.17) 

is the total magnetic moment of the system.j 
For a linear current, this becomes 

Jfrxdl/2c, 

and can be transformed into an integral over a surface bounded by the line of the current. 
The product df = |rxdl is equal in magnitude to the area of the triangular surface 
element formed by the vectors r and dl. The vector Jdf is independent of the particular 
surface (bounded by the current) over which it is taken. Thus the magnetic moment of a 
closed linear current is 

^ = J§df/c. (30.18) 

In particular, for a plane closed linear current the magnetic moment is simply JS/c, where S 
is the area of the plane enclosed by the current. 

To conclude this section, we may briefly discuss the energy flux in a conductor. The 
energy dissipated as Joule heat in the conductor is derived from the energy of the 
electromagnetic field. In a steady state, the equation of continuity which expresses the law 
ofc»„ser,atio„ of energy is |30.,9| 

where S is the energy flux density, given in a conductor by 

S = cExH/47r, (30.20) 

which is formally the same as the expression for the Poynting vector for the field in a 
vacuum. This is easily verified directly by calculating div S from the equations curl E = 0 
and (30.2), when we obtain (30.19). 

Formula (30.20) also follows independently from the obvious condition that the normal 
component of S must be continuous at the surface of a conductor, if we use the continuity 
of E, and H, and the validity of (30.20) in the vacuum outside the body. 


PROBLEMSJ 


Problem 1. Determine the scalar potential of the magnetic field of a closed linear current. 

Solution. We transform the curvilinear integral into one over a surface bounded by the curve, obtaining 


j C 1 

B = curl A =-(df-grad) grad — 


t See Fields, §44. In the derivation there given, we use explicitly the idea of a current as the result of the motion 
of individual charged particles. Such a derivation is, of course, quite general, but formula (30.16) can also be 
obtained by macroscopic arguments (see Problem 4). 
t In Problems 1-4, = 1. 
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(where we have used the fact that A (1/R) = 0). Since B = - grad we have for the scalar potential 
J C I J fdf-R 

,(,=-Jdf-grad-=--J-^-. 

The integral is, geometrically, the solid angle f) subtended by the closed contour at the point considered. The 
above-mentioned many-valuedness of the potential is seen from the fact that, as this point describes a closed path 
embracing the wire, the angle f) changes suddenly from In to — 2n. 

Problem 2. Find the magnetic field of a linear current flowing in a circle with radius a. 

Solution. We take the origin of cylindrical polar coordinates r, <j>, z at the centre of the circle, with the angle 
measured from the plane which passes through the z-axis and the point at which the field is calculated. The 
vector potential has only one component, = A(r, z), and by formula (30.14) we have 



Putting 6 = j(<i>-n), we find 

where = 4or/[(o-l-r)^-l-z''], and K and £ are complete elliptic integrals of the first and second kinds: 
in in 

- - -, £= f y(l-k^sin^e)de. 

J y(l-k"sin='e) J 


The components of the induction are 


= _ ?! _ 

ry/[(a + r)^ + z^-\l (a-r) +z J 


Here we have used the easily verified formulae 

8K _ E _K 

~dk ~ 


On the axis (r = 0) we have B, = 0, = 2na^Jlc(a^ + as can also be found by a straightforward 

calculation. 


Problem 3. Determine the magnetic field in a cylindrical hole in a cylindrical conductor of infinite length 
carrying a current uniformly distributed over its cross-section (Fig. 18). 

Solution. If there were no hole, the field in the cylinder would be given by = - 2njy/c, H\ = 2njxlc. 
The dimensions and axes are as shown in Fig. 18. If a current with density -j were to flow in the inner cylinder, it 
would produce a field H\ = 2njy'/c, H% = - 2njx'/c. The required field in the hole is obtained by sui^rposing 
these two fields. Since x-x' = 00' = h, and y = /, we have = 0, = 2njh/c = 2hJ/{b -a )c, i.e. a 

uniform field in the y-direction. 

Problem 4. Derive from (30.12) the formula (30.16) for the vector potential of the field far from the 
currents. 

Solution. We write R = Ro - r, where Ro and r are the radius vectors from the origin (situated somewhere 
among the currents) to the point considered and to the volume element d V respectively. Expanding the integrand 
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in powers of r and using the fact that J j dF = 0, we have ^ (RJcR^) J xj, d V. The sufRx 0 to R is omitted. 
Integrating by parts the identity J x,.Xj div j dF = 0 gives J 0‘,.Xj + i^Xi) dV = 0. Hence we can write 

A, = (RJlcR^) J {xj\ - xj,) d V, 

which agrees with (30.16). 

Problem 5. Determine the magnetic field produced by a linear current in a magnetically anisotropic 
medium (A. S. Viglin, 1954). 

Solution. In the anisotropic medium surrounding the conductor we have 

div B = ni^dHJdx; = 0, (1) 

where is the magnetic permeability tensor of the medium. Instead of introducing the vector potential by 
“ = curl A, we use another vector C defined by 

= eMti,^dC,ldx„, (2) 

where is the antisymmetrical unit tensor. Then equation (1) is again satisfied identically. We can also impose on 

■he veaor C thus defined the condition 

div C = dCi/dx, = 0. (3) 

Ijjjfcituting (2) in curl H = 47tj/c, we obtain dHJdx^ = - n^^d^CJdx,^dx^ = ^njjc (using the condition (3) 
he fact that The equation thus obtained for C is the same in form as that for the 

«:tric field potential resulting from charges in an anisotropic medium (§13, Problem 2). The solution is 

c=if__ 

^/Mn-\,RiRS 

here |/i| is the determinant of the tensor /i^, and R the radius vector from the point considered to d V. For a linear 
“rrent we have 


C 


_ 

eVI/'l J 


^3l. Thermodynamic relations in a magnetic field 

■The thermodynamic relations for a magnetic substance in a magnetic field are as we 
see, very similar to the corresponding relations for a dielectric in an electric field. Their 
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derivation, however, is quite different from that given in § 10. This difference is ultimately 
due to the fact that a magnetic field, unlike an electric field, does no work on charges 
moving in it (since the force acting on a charge is perpendicular to its velocity). Hence, to 
calculate the change in the energy of the medium when a magnetic field is applied, we must 
examine the electric fields induced by the change in the magnetic field and determine the 
work done by these fields on the currents which produce the magnetic field. 

Thus the equation which relates electric and variable magnetic fields must be used. This 
equation is 


curl E = 


1 ^ 
c dt ’ 


(31.1) 


it follows immediately on averaging the microscopic equation (1.3). 

During a time St, the field E does work J j • E d F on the currents j. This quantity with 
the opposite sign is the work SR “done on the field” by the external e.m.f which.maintains 
the currents. Substituting 

j = c curl H/47t. 

we have 

SR = -St^ fE-curlHdF 

jHcurlEdl 

The first integral, on being transformed to an integral over an infinitely distant surface, is 
seen to be zero. In the second integral we substitute curl E from (31.1) and put = 
St dB/dt for the change in the magnetic induction, obtaining finally 

= jH-^BdF/47t. (31.2) 

This formula appears entirely analogous to the expression (10.2) for the work done in an 
infinitesimal change in the electric field. It must be pointed out, however, that the physical 
analogy between the two formulae is actually not complete, since H, unlike E, is not the 
mean value of the microscopic field. 

Having derived formula (31.2), we can write down all the thermodynamic relations for a 
magnetic substance in a magnetic field by analogy with those given in §10 for a dielectric in 
an electric field, simply replacing E and D by H and B respectively. We shall give some of 
these formulae here for purposes of reference. The differentials of the total free energy and 
the total internal energy are 

S3^ = ~iFST+\l\-SBAVI^n, 

S^U = TS£F + JH -^B d F/47r, 

and those of the corresponding quantities per unit volume are 
df = -Sdr+Cdp + H-dB/47r. 

dt/= r dS + C dp + H • dB/47r. 

We need also the thermodynamic potentials 

U=U-U-B/4n, f = F-HB/47t, 


(31.3) 

(31.4) 

(31.5) 


- Jdiv (E X 
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for which 

dF= -Sdr+Cdp-B-dH/47r, 
dC/ = 7'dS + Cdp-B-dH/47r. 

If the linear relation B = /rH holds, we can write the expressions for all these quantities in 
the form 

t/ = t/o (S, p) + B^/Snp. f = fo (T, p) + B^/B,np, 

V = Vo{S,p)-pH^ISn, F = Foifip)-pH ^/Sn. 

The work SR (or, what is the same thing, the change S^ at constant temperature) can be 
written in a different form, in terms of the current density and the vector potential of the 
magnetic field. For this purpose we put ^B = curl and 

)r = — H • curl d F 

47: J 

= I div(Hx^A)dF + ^ ^A-curlHdF. 

4n] 4n] 

The first integral is again zero, and the second gives 

|j.MdF/c. (31.8) 

A similar transformation gives 

(S^)j = — ^ A-SjdV/c. (31.9) 

It is useful to note that in macroscopic electrodynamics the currents (sources of the 
magnetic field) are mathematically analogues of the potentials, not of the charges 
(the sources of the electric field). This is seen by comparing formulae (31.8) and (31.9) with 
the corresponding results for an electric field: 

(^J^)r= |(;!)^pdF, (S^}r= -\pS(l)dV (31.10) 

(see (10.13), (10.14)). We observe that the charges and potentials appear in these formulae 
in the opposite order to the currents and potentials in formulae (31.8), (31.9).t 
On account of the complete formal correspondence between the thermodynamic 
relations (expressed in terms of field and induction) for electric and magnetic fields, the 
thermodynamic inequalities derived in §18 can also be applied to magnetic fields. In 
particular, we have seen that it follows from these inequalities that e > 0. In the electric case 
this result was of no interest, because it was weaker than the inequality e > 1 which follows 
on other grounds. In the magnetic case, however, the corresponding inequality 
/j > 0 is very important, as it is the only restriction on the values which can be taken by the 
magnetic permeability. 


I (31.7) 


(31.6) 


t The significance of this difference is further discussed in the last footnote to §33. 
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§32. The total free energy of a magnetic substance 


In §11 expressions have been derived for the total free energy of a dielectric in an 
electric field. One of the thermodynamic properties of this quantity is that the change in it 
gives the work done by the electric field on the body when the charges producing the held 
remain constant. In a magnetic field a similar part is played by the free energy since foi 
given currents producing the field the change in is the work done on the body. 

The following derivation is entirely analogous to that given in §11. The “total quantity 
is defined as 

# = J(F + |-jdF, (32.1)| 


where § is the magnetic field which would be produced by the given currents in the absence 
of the magnetizable medium. The plus sign appears in the parenthesis 
sign as in (11.1)) because the value of J" for a magnetic field in a vacuum is - J (§ d t j 
(see (31.7)). The integration in (32.1) is taken over all space, including the volume occupied ^ 
by the conductors in which flow the currents producing the field.! 

Let us calculate the change in ^ (for a given temperature and no departure from 
thermodynamic equilibrium in the medium) corresponding to an infinitesimal change in 
the field. Since 8F = - B- ^H/47t, we have 


5 .^ = 


-|(B-m-§-^§)dF/47r 

_j(H-§)-^§dF/47r-jB-(^H-^§)dF/47r-j(B-H)-^§dF/47r.^^^j 


Introducing the vector potential 91 of the field §, we can write in the first term 


(H-§)-^§= (H-§)-curl^9I 


= div [^9Ix(H-§)] + ^9I-curl(H-§). 

By definition, the fields H and § are produced by the same currents j, distributmn of, 
which over the volume of the conductors is (see §30) independent of the field whic qj 
produce, i.e. is independent of the presence or absence of magnetic substimces in the 
surrounding medium. Hence curl H and curl § are both equal to 47rj/c, and sd 
curl(H - §) = 0. The integral of div [^91 x (H - §) ] is transformed into an integral ovej 
an infinitely distant surface, and so vanishes. 

Similarly, we see that the second term on the right of (32.2) is zero; thus 

-|(B-H)-^§dF/47r 

= -jM-^§dF. (323^ 


The expression which we have obtained for is exactly similar to (11 3) for tk; 
electrostatic problem. In particular, in a uniform magnetic field § we have for dJ' ai 


t In § 11 we took the integration in (11.1) over all space except the volume occupied by the charged 
producing the field. This was possible because there is no electric field in a conductor 

magnetic field, however, inside the conductors which carry the currents, and it cannot be excluded in calc M 
the total free energy. * 
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expression analogous to (11.5): 

= -STAT-J/-A^, (32.4) 

where is the total magnetic moment of the body. 

Without repeating the subsequent calculations, we shall write down the following 
formulae by analogy with those in §11. If the linear relation B = gH holds, we have 

(32.5) 

In particular, if the external field is homogeneous, then 

^-^o{KT)= (32.6) 

In the general case of an arbitrary relation between B and H, ^ can be calculated from the 
formula 



(32.7) 


which is analogous to (11.12) for dielectrics. 

In §11 we gave also the simpler formulae obtained when the dielectric susceptibility is 
small. The analogous case for the magnetic problem is especially important because, as 
mentioned above, the magnetic susceptibility of the majority of bodies is indeed small. In 
this case 

#-J^o=(32.8) 

We can also derive results for the magnetic field analogous to those obtained in §14. 
These concern the change in the thermodynamic quantities resulting from an infinitesimal 
change in the magnetic permeability g, the field sources being assumed unchanged. It is 
clear from the foregoing that we must consider the change in #, and not that in as in 
§14. We shall not repeat the derivation, which is similar to that of (14.1), but merely give the 
result: 

= - l3gH^AV/Sn. (32.9) 

In §14 we used the formula (11.7), an analogue of (32.5), to deduce that the dielectric 
susceptibility of any substance is positive. In the magnetic case we cannot draw this 
conclusion, and the magnetic susceptibility may be of either sign. The reason for this 
ttiarked difference is that the Hamiltonian of a system of charges moving in a magnetic field 
^ntains not only terms linear in the field (as in the electric case) but also quadratic terms. 
Hence, in determining the change in the free energy of the body in the magnetic field by 
Cleans of perturbation theory as in (14.2), we have a contribution in the first approximation 

Well as the second. In such a case no general conclusion can be drawn concerning the sign 
cf the variation. It is positive for paramagnetic bodies and negative for diamagnetic ones. 

In §14 we also drew conclusions concerning the direction of motion of bodies in an 
Hectric field. Similar conclusions follow from (32.9), but, since g may be either greater or 
^ss than 1, there is no universal result. For example, in an almost uniform field 
Paramagnetic bodies (/i > 1) move in the direction of H increasing, and diamagnetic bodies 
iu c 1) in the opposite direction. 
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§33. The energy of a system of currents 


me eiie.gj ~ -- 

Let US consider a system of conductors with currents flowing in them and assume that 
neither the conductors nor the medium surrounding them are ferromagnetic, so that 
B = /iH everywhere. According to §31, the total free energy of the system is given in terms 
of the magnetic field of the currents by 


^ = jH-BdL/87i. 


(33.1) 


Here we omit the quantity which is a constant (at a given temperature) and is not 
related to the currents. The integration in (33.1) is taken over all space, both inside and 
outside the conductors. 

The same energy can also be expressed in terms of the currents by means of the integral 


= IA -jdL/lc; 


(33.2) 


cf. the derivation of (31.8) from (31.2). Here the integration extends only, over the 

conductors, because j = 0 outside them. 

Since the field equations are linear, the magnetic field can be written as the sun^of 
fields resulting from each current alone with no current in the other conductors; H - 
Then the total free energy (33.1) is 




(33.3) 


where 


= |H, • B, d L/871, = JH, • Bfc d L/47i. (33.4) 

We have put since H,-B, == H,-B„, where g is the magnetic 


We have put S^ah = •«(, - «(, ^ .o ^ 

permeability at any point. The quantity may be called the free self-energy of the curren 

in the flth conductor, and the interaction energy of the ath and fcth conductors. It 
should be borne in mind, however, that these names are strictly correct only if the magnetic 
properties of both the conductors and the medium are neglected. Otherwise the heW, ^nd 
therefore the energy, of each current depend on the position and magnetic permeability of 
the other conductors. 


le other conductors. , . a 

The quantities (33.4) can also be expressed in terms of the currents in each conductor, 
in accordance with formula (33.2): 


= 1 • A. d VJ2c, • A, d Vjc = 1 i, • A„ d V,/c. (33.5) 

The integral in is here taken only over the volume of the ath conductor; ^can be 
written as either of the two expressions, in which the integration is over the volume of the 
ath and fcth conductor respectively. j „ 

When the distribution of the current density over the volume of the conductor is given 
Se depends only on the total current passing through a cross-section. Both the cu^ent, 
density j and the field which it produces will be proportional to . Hence the integral .y „„ is j 
proportional to and we write it 

,a = 


(33.61 


where is called the self-inductance of the conductor. Similarly, the interaction energy of 
two currents is proportional to the product 




(33.7) 
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The quantity is called the mutual inductance of the conductors. Thus the total free 
energy of a system of currents is 

^ I. ^'“■*> 

The condition that this quadratic form should be positive definite places 
certain restrictions on the values of the coefficients. In particular > 0 for all a, and 

The calculation of the energy of currents in the general case of arbitrary three- 
dimensional conductors requires a complete solution of the field equations, and is a 
difficult problem. It becomes simpler if the magnetic permeability of both the conductors 
and the surrounding medium can be taken as unity. It should be noted that the energy of 
the currents is then no longer dependent on the thermodynamic state (in particular, on the 
temperature) of the bodies, and hence the free energy in the above formulae may be 
referred to simply as the energy. 

For /I = 1 the vector field potential due to the currents j is given by formula (30.12). 
Hence the self-energy of the nth conductor is 

where both integrations are taken over the volume of the conductor considered, and R is 
the distance between d V and d V. Similarly, the mutual energy of two conductors is 

= l JJ^dF.dF^, (33.10) 

where dF^ and dF(, are volume elements in the two conductors. 

The mutual energy of two linear currents is particularly easy to calculate. In formula 
(33.10) we change from volume currents to linear ones by replacing j<,dF„ and j^dF^ by 
and J^dl^ respectively, and we find that the mutual inductance is f f dl^-dl^/R. 
In this approximation, therefore. depends only on the shape, size and relative position 
of the two currents, and not on the distribution of current over the cross-section of each 
"'■re. It must be emphasized that this simple formula can be obtained for linear currents 
"'ithout imposing the condition that 1. In the approximation where the thickness of the 

"'ires is neglected, their magnetic properties have no effect on the field which they produce, 
^od therefore no effect on their mutual energy. If the magnetic permeability g of the 
Medium surrounding the wires is different from unity, the vector potential is, by (30.15), 
pimply multiplied by g, and therefore so is the magnetic induction. The mutual inductance 
therefore multiplied by the same factor, so that 

L^t = g§§dl,-dlJR. (33.11) 

Jhe self-inductance of linear conductors is much more difficult to calculate; we shall 
°‘scuss it in §34. 

The total energy of a system of linear currents can be written in still another form. To do 
™s, we return to the integral (33.2), which for linear currents becomes 

■^=~Zf§A-dK, 


( 33 . 12 ) 
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where A is the vector potential of the total field at the element dl„ of the ath conductor. The 
main error in going from (33.2) to (33.12) arises from neglecting the change in the field 
(including the field of the current considered) over the cross-section of the wire. Each of the 
contour integrals in (33.12) can be transformed into a surface integral: 

^ A -dl^ = I curl A "df^ = | B -df^, 

i.e. it is the flux of the magnetic induction or magnetic flux through the circuit of the ath 
current. We denote this flux by d>„. Then 


Similarly, the free energy of a linear current J in an external magnetic field, i.e. the energy 
without the self-energy of the field sources, can be expressed in terms of the magnetic flux. 
Evidently 

^ = JO/c, (33.44) 


where C> is the flux of the external field through the circuit of the current J. If the external 
field is uniform, and g = I in the external medium, then fl) = § • jdf. Introducing the 
magnetic moment of the current in accordance with (30.18), we have ^ 

Knowing the energy of a system of currents as a function of their shape, size and relative 
position, we can determine the forces on the conductors by simply differentiating with 
respect to the appropriate coordinates. Here, however, the question arises which 
characteristics of the currents should be kept constant in the differentiation. It is most 
convenient to differentiate at constant current. In this case the free energy is represented by 
3^, and so the generalized force f, in the direction of a generalized coordinate q is 
f, = - (d3^ldq)jj. The suffixes show that the differentiation is effected at constant current 
and constant temperature. Since we omit the term independent of the currents in the free 
energy. and differ only in sign, and so 


f. 


(f) 


A ic^it 


(33.15)1 


here and henceforward the suffix T to the derivatives is omitted, for brevity. 

In particular, the forces exerted on a conductor by its own magnetic field are given by the ^ 
formula , ,, 

(33.16) 
dq’ 




where L is the self-inductance of the conductor. The nature of these forces can be seen as 
follows. For given current (and temperature), # tends to a minimum. Sinc^ 
# = — LJ^/lc^, this means that the forces on the conductor will tend to increase its sel 
inductance. The latter, having the dimensions of length, must be proportional to the 
dimension of the conductor. Thus the effect of the magnetic field is to increase the size o 
the conductor. 

For a current in an external magnetic field we havef 

(33.17) 


t The factor 
latter equation 


i which appears in (32.6) is absent in (33.17) because the magnetic moment of the current in the 
is independent of the field, whereas the magnetic moment m (3Z.ot is itsell due to the field. 
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In all the above formulae for the energy it is assumed that there is a linear relation 
between the magnetic field and induction. In the general case where this relation is 
arbitrary, analogous differential relations can be set up. The change in the free energy 
resulting from an infinitesimal change in the field (at constant temperature) is, by (31.8), 
= |j .^A dV/c or, for a system of linear currents. 

Proceeding as in the derivation of (33.13) from (33.12), we havet 

(33.18) 
c ^ 

Similarly, we find from (31.9) 

S^=--yo,Sf. (33.19) 

Thus we can gay that, for a system of linear currents, is the thermodynamic potential 
with respect to the magnetic fluxes, and ^ with respect to the currents, the two potentials 
being related by 

^ y (33.20) 

c ^ 

Whatever the magnetic properties of the substance, therefore, the thermodynamic 
relations 

JJc = d^ldt^,. <bJc=-d^lcJ, (33.21) 

hold. If these formulae are applied to the case where the field and induction are linearly 
related, so that ^ is given by (33.8), we obtain 

( 33 . 22 ) 

cT 

Thus the inductances are the coefficients of proportionality between the magnetic fluxes 
and the currents which produce the magnetic field. The product J^/c is the magnetic 
flnx through the circuit of the current due to the current (b / a), and / c is that 
'iue to the current itself 

§34. The self-inductance of linear conductors 

In calculating the self-inductance of a linear conductor its thickness cannot be entirely 
•^nglected as it was in calculating the mutual inductance of two conductors. If it were, we 


+ There is an obvious analogy between formulae (33.18) and (10.13) in the magnetic and electric cases 
^Pectively. In the magnetic case, the induction fluxes play the role of charges. The analogy has a clear physical 
"'erpretation. Just as the electric field can be maintained without any external energy supply, by charges on 
ysulated conductors, the magnetic field can be maintained, without any external energy supply, by 
uperconducting solenoids across which the magnetic fluxes remain constant. It is therefore not surprising that 
he changern the free energy ^ in the electric and magnetic cases is governed by the changes in the charges and in 
*he induction fluxes respectively. 
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should obtain from (33.9) the self-inductance L = f fdl -dl'/R. where both integrals are 
taken along the same circuit, and this integral is logarithmically divergent as R ^ 0. 

The exact value of the self-inductance of a conductor depends on the distribution of 
current in it, which may vary with the manner of excitation of the current, i.e. with the 
manner of application of the electromotive force. For a linear conductor, however, the self¬ 
inductance does not, to a fairly high accuracy, depend on the distribution of current over 
the cross-section, t i 

Let us write the self-inductance as L= where and L; result from the 

magnetic field energy outside and inside the conductor respectively. Fora linear conductor, 
the “external” part makes the main contribution to the self-inductance. This is because 
most of the magnetic energy of a closed linear circuit resides in the field at distances from 
the wire large compared with its thickness. For the energy per unit length of an infinite 
straight wire is 

(g^l8n)^H^-2nrdr = (ix^l8n)\{2J jcrf-Inrdr = ^/c^) jdr/r, 

where r is the distance from the axis of the wire and the magnetic permeability of the 
external medium. This integral diverges logarithmically for large r. For a closed linear 
circuit, of course, this divergence disappears, because the integral is “cut off” at distances of i 
the order of the dimension of the circuit. We obtain an approximate value for the energy on 
multiplying this integral by the total length / of the wire, and taking / as the upper limit and 
the radius a of the wire as the lower limit. The result is J^l/c^) log (l/a), and hence the ^ 
self-inductance is 

L = 2g^llog(l/a). (34.1) j 

This expression is said to be of logarithmic accuracy, its relative error is of the order of 
1/log (//fl), and the ratio l/a is assumed to be so large that its logarithm is large.J 
A particular case of a linear conductor is a solenoid, which consists of a wire wound in a 
helix, with the turns very close together. Neglecting the thickness of the wire and the 
distance between the turns, we have simply a conducting cylindrical surface with a “surface” 
conduction current on it. The equation curIH = 47ij/c within the conductor is here 
replaced by the boundary condition. 

nx(H2-H,) = 47ig/c, (34.2) 

where g is the surface current density, H j and Hj the fields on each side of the surface, and n 
the unit normal vector into medium 2; cf the derivation of (29.16). 

If the solenoid is an infinitely long cylinder, the magnetic field which it produces can be 
found very simply. The surface currents flow in circles, and their density g = nJ, where J i* 
the current in the wire and n the number of turns per unit length of the solenoid. The fiel 
outside the cylinder is zero; the field inside is uniform and along the axis of the cylinder, an 


t More precisely, it is independent of the distribution of current provided that the current density var 
appreciably only over distances comparable with the thickness a of the wire. If, however, the distribution is su 
that the current density varies appreciably over distances small compared with o (as happens, for particu 
reasons, in the skin effect and in superconductorsX then the self-inductance does depend on the distribution 


t The assertion made above that the self-inductance is independent of the current distribution actually appl 
not only to the approximation (34.1) but also to the next approximation, in which terms not containing the larg^ 
logarithm are included (or, what is the same thing, the coefficient of //« is mciuaeo ir 
logarithm): see the Problems at the end of this section. 


le argument of th^ 
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is H = AnnJ jc. For this field evidently satisfies the equations div H = 0, curl H = 0 in all 
space outside the conducting surface, and also the boundary condition (34.2) at that 
surface. 

Accordingly, the field energy per unit length of the cylinder is 
= In^n^b^n^J^ jc^, 

where b is the radius of the cylinder and pertains to the material within the solenoid. 
Neglecting the end effects, we can apply this formula also to a solenoid whose length h is 
finite, but large compared with b. Then the self-inductance is 

L — An^n^b^hji^ = Inn^nbl, (34.3) 

where / = Inbnh is the total length of the wire. The greater self-inductance of a solenoid as 
compared with that of a straight wire of equal length (cf (34.1)) is, of course, due to the 
mutual induction between adjoining turns. 


PROBLEMSt 


Problem 1. Determine the self-inductance of a closed circuit of thin wire of circular cross-section. 


Solution. The magnetic field in the wire can be taken to be the same as that inside an infinite straight 
cylinder: H = Urjca^, where r is the distance from the axis of the wire and a its radius. Hence we find the internal 
part of the self-inductance: 

where 1 is the length of the wire. 

To calculate L,, we notice that the field outside a thin wire is independent of the distribution of current over its 
cross-section. In particular, the energy of the external magnetic field is unchanged if we assume that the current 
flows only on the surface of the wire. The field inside the wire is then zero, and may be calculated as the total 
energy from formula (33.2). On account of the assumed surface distribution of the current, the integral in this 
formula becomes a line integral along the axis of the wire, and soUhe external part of the self-inductance is 


2c^ J 

1^2c 


[A],=„-dl, 


where the value of A in the integrand is taken at the surface of the wire. In obtaining this formula we also use the 
fact that, in the approximation used here, the field is constant along the perimeter of a cross-section. 

Having reduced the problem to that of calculating A for r = a, we now make a different assumption concerning 
•he Current distribution, namely that the whole current J flows along the axis of the wire. The field on the surface 
of the wire is, in the approximation considered, unchanged by this assumption (nor would it be changed for a 
straight wire of circular cross-section). Then, by formula (30.14), we have 


)''here R is the distance from the element dl of the axis to a given point on the surface of the wire. We divide the 
'^f^Sral into two parts, one for which R> & and the other for which R < A, where A is a distance small compared 
•h the dimension of the circuit but large compared with the radius a of the wire.} In the integral where R> A, 
•nay be neglected and R taken simply as the distance between two points on the circuit. The vector integral where 
< A may be assumed to be along the tangent at the point considered. Denoting by t the unit vector in that 
'’Section, we have 


S 2tlog(2A/o). 

t In Problems 1-6 we assume n, = 1. 

} A similar procedure was used to calculate the capacitance of a thin ring in §2, Problem 4. 
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This expression can be written as the integral 

J dl/R. 

where R is again the distance between points on the circuit. Adding the i 
obtain 



from which the arbitrary parameter A has disappeared, as it should. 
The final result is therefore 



' integrals for R > A and R < A, we 


( 2 ) 


The integration here extends over all pairs of points on the circuit whose distance apart exceeds \a. 

Problem 2. Determine the self-inductance of a thin wire ring (with radius b) of circular cross-section (with 
radius a). 

Solution. The integrand in (2), Problem 1, depends only on the central angle (p subtended by the chord R 
and R = 2fcsin^</), while dl -dl' = dldl'cosi/). Hence 


F = 2 


i (j) • 2nb • bd(j> 
2b sin ^(p 


47ifc [ - log tan J</)o - 2 cosy(/)o ]■ 


The lower limit of integration is determined from 2b sin \(po = {a, whence <pf, = a/2fc. Substituting this value and 
adding Lj = we have to the required accuracy 


L = 47ifc [log m/a)-2 + iti, ]. 


In particular, for /i, = 1 we obtain 

L = 47ifc[log(8fc/a)-(7/4)]. 

Problem 3. Determine the extension of a ring of wire (with /i,- = 1) under the action of the magnetic field ofa 
current flowing in it. 

Solution. The internal stresses parallel and perpendicular to the axis are, by (33.16), given by 


2 _J^ dL _J^8L 

" 2c^ S(2nb) ’ 2F Sa 

Substituting L from Problem 2, we have 



Hence the required relative extension of the ring is 




where E is Young’s modulus and a Poisson’s ratio for the wire; see TE, §5. 


Problem 4. Determine the self-inductance per unit length of a system of two 
p, = 1) having circular crOss-sections of radii a and b. with their axes a distance 
currents J in opposite directions (Fig. 19). 


parallel straight wires ('Vi‘‘l j 
h apart, and carrying equ*, 


Solution The vector potential of the magnetic field of each current is parallel to the axes of the wires, and s« | 

the two vector potentials can be added algebraically. For the magnetic field of wire 1, with a uniformly distribute 
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Fig. 19 


current + J, we have in cylindrical polar coordinates 


= -^C-1 -21og-j for r > a. 

where C is an arbitrary constant; is continuous at the surface of the wire. The formulae for wire 2 are obtained 
by substituting b for a and changing the sign of J. Integration over the cross-section of wire 1 in formula (33.2) 
gives 



The integration over the cross-section of wire 2 gives the same thing with a in place of b. The required self¬ 
inductance per unit length of the double wire is therefore 

L = 1 +2 log (h^/ab). 

Problem 5. Determine the self-inductance of a toroidal solenoid. 


Solution. We regard the solenoid as a toroidal conducting surface carrying surface currents with density 
9 = A/J/2nr, where N is the total number of turns and J the current; the coordinates and dimensions are as shown 
^ Pig. 20 (p. 126). The magnetic field outside the solenoid is zero, and inside the solenoid f/,, = f/jj = 0, 
"i* = INJjcr, where r, z, (p are cylindrical polar coordinates; for this solution satisfies the equations 
divH = 0, curl H = 0 and the boundary condition (34.2).t The energy of the magnetic field in the solenoid is 

J(f/jV8n)dF = (N^J V‘:^)$zdr/r, 

"'here the integration is taken along the perimeter of the cross-section, and is easily effected by putting z = a sin 6, 
''~b + a cos 6. The self-inductance is found to be 


L = 4KN^[b-^{b^-a^)]. 

Problem 6. Determine the end-effect correction of order t/h to the expression (34.3) (with = 1) for the 
®®lf-inductance of a cylindrical solenoid. 


Solution. The self-inductance is calculated as a double integral o- 


r the surface of the solenoid: 


t It is valid also for an annular solenoid with an 


ny cross-section. 
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where g is the surface current density (g = nJ). In cylindrical polar coordinates 


-m 


Ds</)d</)dz, dz2 


J[{z,-z,r + 4b^s 




{h-Qcos(pd(j>dC 

+4Psin^\^) 


where </> is the angle between the diametral planes through df, and d/2, and C - Z2 z,. 
Effecting the integration with respect to C, we have for fi ^ fc 


L S inFn^ 


<l>d(j>. 


and finally 


L = 4KVn^[h-Sb/3Tt]. 


PROBLEM 7. Determine the factor by which the self-inductance of a plane circuit chants i* ^ 1 

the surface of a half-space of magnetic permeability The internal part of the self-inductance negl 


the surlace ol a iiaii-space oi magnetic penucaumij -- 

SOLUTION. Itisevidentfromsymmetrythat,intheabsenceofthehal^f-space,themagneticfie^^^^^^^ 

is symmetrical about the plane of the circuit, and the lines of magnetic force cross that ace bJ 

field be Ho. We can satisfy the field equations and the boundary conditions on the surface 

putting H = 2g,Ho/(g. + 1) in the vacuum and B = g.H = m ‘he ntedium: B„ th^ 

Lntinuous at the boundary, and the circulation of H along any line offeree is equd to 

see that, when the medium is present, the total energy of the field, and therefore the self-inductanc 

are multiplied by 2/i^/(/Xe+ !)• 


§35. Forces in a magnetic field ^ 

To determine the forces on matter in a magnetic field hardly any further calculations af^ I 
necessary, on account of the complete analogy with electrostatics The ^ 

mainly to the fact that the expressions for the thermodynamic quantities in a niagn 

.hose fo, an elLic held only in ,ha. E and dearie he" I 

respectively. In calculating the stress tensor in §15 we used the fac the elect U 
sSlifies the equation curlE = 0, and is therefore a potential field. The magnetic fid , 

satisfies the equation /tS ll? 

curlH = 47ij/c, (35-P' 
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which reduces to curl H = 0 only in the absence of conduction currents. In calculating the 
stress tensor, however, we must always put j = 0. Since j involves the derivatives of the 
niagnetic field, an allowance for the currents in calculating the stresses would amount to 
adding to the stress tensor the very small corrections due to the non-uniformity of the 
field; cf the second footnote to §15. 

Thus all the formulae obtained in §§15 and 16 for the stress tensor can be applied 
immediately for a magnetic field. For example, in a fluid medium with B = we have 

, 35 ., 

From this the body forces are calculated by the formulay,- = SoiJdx,,. If the medium is a 
conductor carrying a current, the calculation differs from that in §15 in that the equation 
curlH = 0 is replaced by (35.1). 

Differentiating (35.2) and using also the equation divB = div(^H) = 0, we find 
f=-gradP. + lgn.d[Hv(|;)J-f^8rad^-Ag«dH= + A(H-gradlH. 

By a well-known formula of vector analysis, 

(H • grad)H = igrad - H x curl H 

= +4ni xH/c. 


f = -gradP( 




The last term does not appear in the corresponding formula (15.12). It would, however, 
be incorrect to suppose that the presence of this term means that a force can be isolated in f 
which is due to the conduction current. The reason is that, by (35.1), the current j is 
inseparable from non-uniformity of the field, and another term in (35.3) also involves the 
space derivatives of the field. When the magnetic permeability of the medium is appreciably 
different from unity, all the terms in (35.3) are in general of the same order of magnitude. 

If, however, as usually happens, g is close to 1, the last term in (35.3) gives the main 
contribution to the force when a conduction current is present, and the remaining terms 
orm only a small correction. In calculating the forces we can then put ^ = 1, obtaining 
simply 

f = jxH/c. (35.4) 


^0 term — grad is of no interest henceforward, and we omit it. For ^ = 1 the properties 

° the substance have no effect on the magnetic phenomena, and the expression (35.4) for 
0 force is equally valid for fluid and for solid conductors. The total force exerted by a 
■Magnetic field on a conductor carrying a current is given by the integral 


F=jjxHdF/c. (35.5) 

["ormula (35.4) can, of course, be very easily obtained from the familiar expression for the 
t-orentz force. The macroscopic force on a body at rest in a magnetic field is just the 
^veraged L orentz force exerted on the charged particles in the body by the microscopic 
held h: f = pv X h/c. For p = 1 the field h is equal to the mean field H, and the mean value 
pv is the conduction current density. 
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Static Magnetic Field 

When a conductor moves, the forces (35.4) do mechanical work on it. At first sight it 
might appear that this contradicts the result that the Lorentz forces do no work on moving 
charges. In reality, of course, there is no contradiction, since the work done by the Lorentz 
forces in a moving conductor includes not only the mechanical work but also the work 
done by the electromotive forces induced in the conductor during its motion. These two 
quantities of work are equal and opposite; see the second footnote to §63. 

In the expression (35.4) H is the true value of the magnetic field due both to external 
sources and to the currents themselves on which the force (35.4) acts. In calculating the 
total force from (35.5), however, we can take H to be simply the external field § in which 
the conductor carrying a current is placed. The field of the conductor itself cannot, by the 
law of conservation of momentum, contribute to the total force acting on the conductor. 

The calculation of the forces is particularly simple for a linear conductor. Its magnetic 
properties are of no significance, and, if ^ = 1 in the surrounding medium, the total force 
on the conductor is given by the line integral 

r = J|dlx§/c. (35.6) 

This expression can be written as an integral over a surface bounded by the current 
circuit. Using Stokes’ theorem, we replace dl by the operator dfxgrad, obtaining 
|dl X § = J{df X grad) x §. Now 

(df X grad) x § = — df div § + grad (df • §) 

= -dfdivS + df xcurlS + (df-grad)S. 

But div S = 0, and in the space outside the currents curl § = 0 also. Thus 

F = J J(df •grad)S/c. (35.7) 

In particular, in an almost uniform external field § can be taken outside the integral, 
together with the operator grad. With the magnetic moment of the current given by 
(30.18), we then have the obvious result 

F =(.///• grad) §. (35.8) 

Since in this formula is constant, we can also write 

F = grad(.///-S), (35.9) 

in agreement with the expression (33.17) for the energy of the current. The torque acting on 
a current in an almost uniform field is easily seen to be given by the usual expression 
K = .-#xS. (35.10) 


PROBLEM 


Determine the force on a straight wire carrying a current J and parallel to an infinite circular cylinder with 
magnetic permeability radius a and axis at a distance b from the wire. 


Solution . On account of the relation, mentioned in the second footnote to §30, ^tween two-dimensional 
problems of electrostatics and magnetostatics, the field of the currentis 

by changing the notation. The field in the space round the cylinder is ' •■* 

current J and currents -t-J' and — J' through A - ' ‘ 

The field within the cylinder is the same as that d 


J 1.1^ .o ,.ie same as that produced in a vacuum by tha 

id O' (Fig. 12, §7) respectively, where J' = (,«- 1) j/(,„-y P 
a current J" = 2J/(P + t) through O. The force per unit 
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length of the conductor is 


VO/4 00 ') 


2JV(/t-l) 
b(b^-a^)(n + l)c^' 

I Similarly we find (see §7, Problem 4) that a linear conductor passing through a cylindrical hole i 
medium is attracted to the nearest surface of the hole by a force 

F = 2J - l)/(a^ - + lie"*. 


. magnetic 


§36. Gyromagnetic phenomena 

Uniform rotation of a body (having no magnetic structure) causes a magnetization 
which is linearly dependent on the angular velocity 11 (the Barnett effect). 
Phenomenologically, a linear relation between the vectors and ft is possible because 
both change sign under time reversal. Since both are axial vectors, the relation can hold 
even in an isotropic body, where it reduces to a simple proportionality between.and ft. 

There must also be an inverse effect: a freely suspended body, on being magnetized, 
begins to rotate (the Einstein-de Haas effect). There is a simple thermodynamic relation 
between the two effects; it can be derived as follows. 

As we know (see SP 1, §26), the thermodynamic potential with respect to the angular 
velocity (for given temperature and volume of the body) is the free energy of the body 
in a system of coordinates rotating with it. The angular momentum L of the body is 

E=-d^'ldSl. (36.1) 

The gyromagnetic phenomena are described by adding to the free energy a further term, 
which is the first term containing both ft and M in an expansion in powers of ft and of the 
magnetization M at each point in the body. This term is linear in both, i.e. it is 

# = - J4 ft;M, d F = - (36.2) 

where is a constant tensor, in general unsymmetrical. 

According to (36.1) and (36.2) the angular momentum acquired by the body as a result of 
magnetization is related to its total magnetic moment by Lgyro.i = ^nc^k- is usual to 
replace 2,-^ by the inverse tensor, defined as 0,^ = {2mcle)X~'^ii„ where e and m are the 
electron charge and mass. The dimensionless quantities 0;^ are called gyromagnetic 
coefficients. Then 

Jii = (e/2mc)fi(,.fcLgy„^. (36.3) 

The expression (36.2) also shows that, as regards its magnetic effect, the rotation of the 
body is equivalent to an external field = 2^,.ft^ or 

$.. = (2mc/e)3-\,.ft,. (36.4) 

^e thus have the possibility, in principle, of calculating the magnetization caused by the 
rotation. For example, if the magnetic susceptibility Xa of the body is small, the magnetic 
moment which it acquires is independent of its shape and is 

= XikS>k = (^fnc/e)Xikg~^ii,Q,. 

formulae (36.3) and (36.4) represent respectively the Einstein^e Haas and Barnett effects. 
We see that both effects are determined by the same tensor 
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FERROMAGNETISM AND 
ANTIFERROMAGNETISM 


§37. Magnetic symmetry of crystals 

There is a profound difference between the electric properties of crystals and their 
magnetic properties, which results from a difference in the behaviour of charges and 
currents with respect to time reversal. 

The invariance of the equations of motion with respect to this change means that the 
formal substitution — t, on being applied to any state of thermodynamic equilibrium 
of a body, must give some possible equilibrium state. There are then two possibilities: 
either the state obtained by changing the sign of t is the same as the original state, or it is 
not. 

In this section we denote by p{x, y, z) and j(x, y, z) the true (microscopic) charge and 
current densities at any given point in the crystal, averaged only over time, and not over 
“physically infinitesimal” volumes as in the macroscopic theory. These are the functions 
which determine the electric and magnetic structure of the crystal respectively. 

When t is replaced by — r, j changes sign. If the state of the body remains unchanged, it 
follows that j = — j, i.e. j = 0. Thus there is a reason why bodies can exist in which the 
function j (x, y, z) is identically zero. In such bodies, not only the current density but also 
the (time) average magnetic field and magnetic moment vanish at every point (we are 
speaking, of course, of states in the absence of an external magnetic field). Such bodies may 
be said to have no “magnetic structure”, and indeed the great majority of bodies fall into 
this category. 

The charge density p, on the other hand, is unchanged when —t. There is therefore 
no reason why this function should be identically zero. In other words, there are no 
crystals without “electric structure”, and herein lies the essential difference, mentioned at 
the beginning of this section, between the electric and the magnetic properties of crystals- 

Let us now consider crystals for which the change from t to — t results in a change of 
state, so that j + O.We shall say that such bodies have a magnetic structure. First of all, we 
note that, although j is not zero, there can be no total current in an equilibrium state of the 
body, i.e. the integral JjdF taken over a unit cell must always be zero.t Otherwise the 
current would produce a macroscopic magnetic field, and the crystal would have a 
magnetic energy per unit volume increasing rapidly with its dimensions. Since such a state 
is energetically unfavourable, it could not correspond to thermodynamic equilibrium- 

The currents j may, however, produce a non-zero macroscopic magnetic moment, i-e- 

t It should be emphasized that the cell spoken of here is the true unit cell whose definition involves the 
magnetic structure of the crystal, and this “magnetic cell” may be different from the purely crystallographic cel. 
which relates only to the symmetry of the charge distribution in the lattice (cl. i)Je below). 
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the integral J r x j d F, again taken over a unit cell, need not be zero. Accordingly, the bodies 
for which j 0 may be divided into two types: those in which the macroscopic magnetic 
moment is not zero, called ferromagnets, and those in which it is zero, called 
ffitiferromagnets. 

What are the possible symmetry groups of the current distribution j (x, y, zf. This 
symmetry contains, first of all, the usual rotations, reflections and translations, and so the 
possible symmetry groups of j always include the usual 230 crystallographic space 
symmetry groups. These, however, are by no means all. As has already been mentioned, the 
substitution changes the sign of the vector j. For this reason a new symmetry 

element comes in, namely that resulting from the reversal of all currents; we shall denote 
this transformation by R. If the current distribution itself has the symmetry R, it follows that 
j = - j, i.e. j = 0, and the body has no magnetic structure. A non-zero function j (x, y, z) 
may, however, be symmetrical with respect to various combinations of R with the other 
symmetry elements (rotations, reflections and translations). Thus the problem of 
determining the possible types of symmetry of the current distribution (the magnetic space 
groups) amounts to the enumeration of all possible groups containing both the 
transformations of the ordinary space groups and the combinations of these with R. 

If the symmetry of the current distribution is given, the crystallographic symmetry of 
the particle distribution, which is also the symmetry of the function p (x, y, z), is 
determined. It is the symmetry of the space group which is obtained from the symmetry 
group of j by formally regarding the transformation R as the identity (as it is with respect 
to the function p). 

If only the macroscopic properties of the body are of interest, however, it is not necessary 
to know the complete symmetry group of the function j (x, y, z). These properties depend 
only on the direction in the crystal, and the translational symmetry of the lattice does not 
affect them. As regards crystallographic structure, the “symmetry of directions” is specified 
by the 32 crystal classes. These are the symmetry groups consisting of rotations and 
reflections only, and are obtained from the space groups by regarding every translation as 
the identity, and the screw axes and glide planes as simple axes and planes of symmetry. As 
regards the magnetic properties, the macroscopic symmetry can be classified by groups 
(consisting of rotations, reflections and combinations of these with R) which may be called 
the magnetic crystal classes. They are related to the magnetic space groups in the same way 

are the ordinary crystal classes to the ordinary space groups. They include, firstly, the 
nsual 32 classes, and those classes augmented by the element R. These augmented classes 
in particular, the macroscopic symmetry groups for all bodies having no magnetic 
structure, but they occur also in bodies with magnetic structure. This happens if the 
tragnetic space symmetry group of such bodies includes R only in combination with 
translations, and not alone. 

There are also 58 classes in which R enters only in combination with rotations or 
■■eflections. Each of these becomes one of the ordinary crystal classes if R is replaced by the 
'dentity. 

It should be noted that the occurrence of magnetic structure (ferromagnetic or 
®ntiferromagnetic) always involves comparatively weak interactions.f Hence the crystal 


t The exchange interaction between the magnetic moments of atoms usually results in the saturation of the 
'’alency bonds and the formation of non-magnetic structures. A magnetic structure results only from the relatively 
'^eak exchange interactions between deep-lying d and / electrons of atoms of elements in the intermediate groups 
the periodic system. 




132 Ferromagnetism and Antiferromagnetism 

structure of a magnetic body is only a slight modification of that in the non-magnetic 
phase, which usually changes into the magnetic phase when the temperature is reduced. In 
this respect ferromagnets, in particular, differ from ordinary pyroelectrics, but are 
analogous to ferroelectrics. 

If the magnetic crystal class of a body is specified, its macroscopic magnetic properties 
are qualitatively determined. The most important of these is the presence or absence of a i 
macroscopic magnetic moment, i.e. of spontaneous magnetization in the absence of an | 
external field. The magnetic moment M is a vector, behaving as an axial vector (the vector. 
product of two polar vectors) under rotation and reflection, and changing sign under the | 
operation R. The crystal will possess spontaneous magnetization if it has one or more | 
directions such that a vector M in that direction and having the above-mentioned 
properties is invariant under all transformations belonging to the magnetic crystal class | 
concerned. 

We must again emphasize the difference between these (macroscopic) properties and the j 
corresponding ones in electrostatics. The latter are qualitatively determined by the ^ 
ordinary crystal class. In particular, a body is pyroelectric if its crystal class admits the 
existence of a polar vector P (the polarization). It would, however, be entirely wrong to 
base conclusions about the existence or otherwise of a macroscopic magnetic moment or 
the behaviour of the axial vector M with respect to the transformations of the purely 
structural crystal class of the body concerned, which corresponds to the symmetry of the 
function p{x, y, z); we shall return to this problem in §38, after actually constructing the 
magnetic classes. 

Instead of the symmetry of the function j (x, y, z), we can use that of the microscopic 
magnetic moment density distribution M(x, y, z) = rxj(x, y, z). This in turn may be 
regarded as the symmetry of the configuration and orientation of the (time) averaged 
values of the magnetic moments p of the atoms or ions in the lattice. In a body without 
magnetic symmetry these averaged values are zero. In a ferromagnet the sum of the atomic 
moments in each unit cell is non-zero, but in an antiferromagnet it is zero. 

A set of atoms in the lattice having equal values of p is called a magnetic sub-lattice. An 
antiferromagnet will evidently contain at least two sub-lattices with equal and opposite 
values of p. If the directions of the moments p are parallel or antiparallel for all sub-lattices, 
such a body is said to be collinear, in the contrary case, it is a non-collinear antiferromagnet 

A ferromagnet also may contain more than one sub-lattice. In the narrow sense, the term 
“ferromagnetic” applies to bodies in which all the mean atomic magnetic moments are 
parallel. A crystal is said to he ferrimagnetic if it contains two or more sub-lattices wit 
atomic moments that differ in direction or magnitude. In contrast to the antiferromagnet 
case, in these substances the vector sum M of the magnetic moments of the sub-lattices ^ 
not zero. A ferromagnet may be collinear, or not, according as the magnetic moments oi a 
its sub-lattices are parallel or antiparallel, or not. 

§38. Magnetic classes and space groups i 

We shall now show how the magnetic symmetry groups are constructed, beginning Witl* j 
the magnetic classes. , 

As already mentioned in §37, the magnetic classes fall into three types. Type I include 
the 32 ordinary crystal classes, which do not involve the element R. ype H includes th 
same 32 classes augmented by R. Each such class contains all the elements of the ordinary 
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class (of the point group G) and also all these elements multiplied by R; if the magnetic class 
is denoted by M, then we can write 

M = G + RG. (38.1) 

The transformation R commutes, of course, with all spatial rotations and reflections; hence 
gG = GR, where G is any element of G. 

These two types of class are, in a sense, trivial. The non-trivial type III includes the 58 
magnetic classes, in which R enters only in combination with rotations or reflections. Each 
of these becomes one of the ordinary crystal classes of G if R is replaced by the identity. The 
construction of all magnetic classes of this type is carried out as follows. 

Let H denote the set of elements of the group G which are not multiplied by R when the 
magnetic class M is formed. Such a set includes, by definition, the unit element E, since 
otherwise M would contain the element R itself, i.e., would belong to type II. The products 
of any two of the elements in this set are also members of the set. Thus H is a sub-group of 
G. All other elements of G appear in M multiplied by R; since R^ = E, all products of pairs 
of these elements belong to H. It follows that H is a sub-group of G (and therefore of M) 
with index 2.t The structure of a type III magnetic class M may therefore be written as 

M = H + RGiH, (38.2) 

where G, is any element of G that is not an element of H. It is evident that the groups M and 
G = H + GiH are isomorphous. 

The problem of constructing all the magnetic classes thus amounts to finding the sub¬ 
groups with index 2 in all the crystal classes. This is easily done by means of the character 
tables for the irreducible representations of the point groups. Every one-dimensional 
(other than the unit) representation of the group contains an equal number of characters 
+1 and — 1; the elements with characters -t-1 form a sub-group with index 2. On changing 
to the magnetic class, these elements remain the same, while all others are multiplied by R. 

The procedure may be illustrated with reference to the point group The character 
table for its irreducible representations is (see QM, §95) 

E C2 2 C 4 2a, 2a', 
^,11111 
A 2 1 1 1 -1 -I 

B^ I 1-1 1-1 

B2 1 1-1-1 1 

£ 2 -2 0 0 0 

^he one-dimensional (other than the unit) representations are A2, Rj, £ 2 - 
representation A2, the elements with characters +1 form the sub-group C 4 . The 
Corresponding magnetic class, denoted by consists of the elements £, C 2 , 2 C 4 , 

2R&,. In the representations B^ and B2, the elements with characters +1 form the 
^rib-groups C 2 ,., which differ only in the position of the planes a, relative to fixed 
ooordinates. These sub-groups are crystallographically indistinguishable, and correspond 
'0 the same magnetic class €^,(€2,). whose elements are £, Cj, 2 RC 4 , 2a,, 2Ra',. 


t This means that H contains half as many elements as G. The statement is a consequei 
ii nich is fairly obvious: a sub-group H of G has index 2 if and only if the product of anv t\ 
lot belong to H is an element of H. 


:e of a general theorem 
I elements of G that do 
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Table 1 Magnetic classes 


Ci(C.) 

C.(C.) 

^(C.) 

C2AC„CA 

D^iC^) 

CAC2) 

SAC2) 

02,(54, O2, C2A 
DAC^,D2) 

C^AC^,C2v) 

QhCQ, C2H, 54) 

D4JO4, C4„, O2,) 

56(C3) 


C3„(C2) 

03(C3) 

03,(03,5e,C3„) 

C3.(C3) 

Q(C3) 

034C3„C3,,03) 

C<„(Ce,56,C3,) 

Q„(Ce,C3„) 

06(C6,03) 

061,(06, 03„03*) 

TAT) 

0(T) 

TAT) 

OAO, Tm T,) 


On going through all the 32 crystal classes in this way, we obtain the 58 magnetic classes 
of type III, as listed in Table 1. Each class G (ff) is defined by an original point group G and 
a sub-group H thereof which is one of those listed in parentheses after the symbol for the 
particular group G. The crystal classes C„ C3 and T have no sub-groups with index 2, and so 
they give rise to no magnetic classes. The rotation C3, moreover, never appears multiplied 
by R in a (non-trivial) magnetic class: the rotation C^R thrice repeated would give R, which 
is not a member of such classes.f 

It has been mentioned in §37 that the crystal class concerned does not determine whether 
or not ferromagnetism can exist. To illustrate this, let us consider a tetragonal lattice of 
identical atoms, with magnetic moments parallel to the tetragonal axis.J Its magnetic 
crystal class is containing the transformations§ E, C2,2C4, 2U2R, o,,, 

2S4 , 2<t„R, 2<t'„R. All these transformations leave invariant the axial vector M, which is 
parallefto theVourfold axis. The crystal class itself, however, would not allow the 
existence of an axial vector: all the components M^, M^, would change sign in a 
rotation about some twofold axis, for example. 

Let us now turn to the magnetic space groups. These are in the same relation to t e 
ordinary crystal space groups as the magnetic classes are to the crystal classes, reducing to 
these if R is replaced by the identity. The total number of magnetic space groups is 1651- 
like the magnetic classes, they fall into three types. 

Type I contains 230 groups which coincide with the crystal groups and do not involve > 
type II has the same 230 groups augmented by R. 

The non-trivial type III contains 1191 groups in which R enters only in combination wi 
rotations, reflections or translations. It has the structure (38.2), where H is any sub-gro 


t In abstract symmetry theory, magnetic symmetry is called antisymmetry. This concept 

oposed by H. Heesch (1929) and A. V, Shubnikov (1945). The antisymmetry clas.ses were found by imuoi 


proposed by --- --^ - , r tnr" 

(1951) as the symmetry groups of geometric figures (polyhedra) with faces of two colours; the element rv 
rr,rrpQ,ir,nH4 tr, rhanoino the colour of each face These classes were derived as magnetic symmetry groups oy 


corresponds to changing the colour of each face. These classes wcic ucivcu oo. ilQSQl 

Tavger and V. M. Zaitsev (1956). The method of derivation given here is due to V. L. ^ 

t Such, for instance, is the lattice of iron in its ferromagnetic phase. Crystallographically, it is a ^ubi 1 ^ 
slightly distorted along one of the fourfold symmetry axes. The distortion is the result of magnetostnct 

simply from the presence of the magnetic structure. c,,, kkq 2 and 94 In narticilar t/ and I/', at'j 

§ The notation for the symmetry elements is everywhere as m QM, §§93 and 94 In parbcular, U2 and V 2 A 
rotations through 180“ about horizontal axes perpendicular to the fourfold axis, u* a reflection in the horiz 
plane, (t„ and o',, are reflections in vertical planes passing through the r —“ 
etc. denote symmetry planes and axes which appear in this class ir 


_C4and l/j, T1'2. The symbols U2.RA^ 

combination with R. 
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with index 2 of the crystal space group G, and G, is an element of G that is not an element of 
//• The sub-group H must evidently coincide with the original space group G as regards 
either translational symmetry or class, having respectively either half as many “rotational” 
dements (rotations and reflections) as G, or half as many translations. Accordingly, type III 
inay be subdivided into two sub-types. 

^ Sub-type Ilia contains the magnetic space groups for which Gj in (38.2) is a “rotational” 
iansformation of the crystal group G = H+GiH which does not belong to H. The 
fcnslational symmetry (Bravais lattice) of a space group M of this type is the same as that 
of G; that is, the unit cell of the magnetic structure is the same as the purely crystallographic 
one. These magnetic space groups, 674 in number, belong to magnetic classes of type III. 
\ Sub-type Illb contains the magnetic space groups for which G, in (38.2) may be taken to 
be a pure translation through one of the basic periods of G. The unit cell of the magnetic 
structure has a volume twice that of the crystallographic unit cell. The set of pure 
translations and translations multiplied by R forms the magnetic Bravais lattice; there are 
22 different lattices of this kind. The magnetic space groups of sub-type Illb, 517 in 
number, belong to magnetic classes of type Il.f 


PROBLEM 

List the magnetic classes that allow ferromagnetism. 

Solution. The classes of type II do not allow ferromagnetism. It must be emphasized that the same is 
therefore true of every space group of type II or Illb that contains translations multiplied by R; these 
transformations certainly change the sign of M. In other words, for ferromagnetism to occur, it is always 
necessary that the magnetic unit cell should coincide with the crystallographic one.J 

Of the classes of type I (which are the same as the ordinary crystal classes), the following allow the existence of 
an axial vector M; Cj and C, with M in any direction; C, with M in the plane of symmetry; Cj, C^. S^. 

Cj. Q, Sf„ Cgj with M parallel to the axis of symmetry. 

For ferromagnetism to exist in classes of type III, they must never contain the element IR, which changes the 
®gn of the vector M whatever the direction of the latter. Of such classes, the following allow ferromagnetism: 
CIC), with M perpendicular to the C^R axis; CJCj ) with M in the aR plane; C2JCJ with M in the 

’’.R plane and perpendicular to the R axis; (Q), (Q), (QJ with M parallel to the axis; (Q), 

DyJSf,), OjdC,*), CeCQ), C6„(Q), OedQd with M parallel to 

§39. Ferromagnets near the Curie point 

There is a close analogy between the magnetic properties of ferromagnets and the electric 
properties of ferroelectrics. Both exhibit spontaneous polarization, magnetic or electric, in 
irracroscopic volumes. In each case, this polarization vanishes at a temperature correspond- 
r'8 to a second-order phase transition (the transition point between the ferromagnetic and 
Paramagnetic phases is called the Curie point). 

There are also, however, important differences between ferromagnetic and ferroelectric 

onomena, arising from the difference in the microscopic interaction forces which bring 


'' Be! Nf If N," groups were constructed (as antisymmetry groups) by A. M. Zamorzaev (1953) and by N. 

Belov N, N. Neronova, and T S. Smirnova (1955); the latter authors’ tables are given by A, V, Shubnikov and 
&onn f Symmetry Pergamon Press, Oxford, 1964. The most complete tables of the magnetic space 

C UnWersity PrX groups {Shubnikovskie gruppy), Moscow 

t This refers (see the last footnote but two) to the symmetry of a lattice already distorted by the very existence 
* the magnetic structure. 
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about the spontaneous polarization. In ferroelectrics, the interaction between the 
molecules in the crystal lattice is essentially anisotropic, and consequently the spontaneous 
polarization vector is fairly closely related to certain directions in the crystal. The 
formation of a magnetic (including ferromagnetic) structure, on the other hand, is due 
mainly to the exchange interaction of the atoms, which is quite independent of the 
direction of the total magnetic moment relative to the lattice.t It is true that, together with 
the exchange interaction, there is also a direct magnetic interaction between the magnetic 
moments of the atoms. This interaction, however, is an effect of order v^jc^ (u being the 
atomic velocities), since the magnetic moments themselves contain a factor vjc. The effects 
of this order include also the interaction of the magnetic moments of the atoms with the 
electric field of the crystal lattice. All these interactions, which may be called relativistic by 
virtue of the factor 1/c^ in them, are weak in comparison with the exchange interaction, so 
that they can result only in a comparatively slight dependence of the energy of the crystal 
on the direction of magnetization. This relationship between the exchange and relativistic 
interactions will be assumed throughout the rest of the chapter.^ 

Consequently, the magnetization of a ferromagnet is a quantity which, in the first 
approximation (i.e. on the basis of the exchange interaction), is conserved. This fact endows 
with greater physical significance the thermodynamic theory, in which the magnetization 
M is regarded as an independent variable, the actual value of which (as a function of 
temperature, field, etc.) is afterwards determined by the appropriate conditions of thermal 
equilibrium. § 

We denote by 3)(M, H) the thermodynamic potential per unit volume of the substance, 
regarded as a function of the independent variable M (and of the other thermodynamic 
variables). We shall, for the present, neglect the relativistic interactions, i.e. we shall take 
into account only the exchange interaction. Then d)(M, 0) may be a function of the 
magnitude of M, but not of its direction. 

In order to find the thermodynamic quantities when the field H is not zero, we proceed 
exactly as in the derivation of (19.3), starting from the relation dO/dH = — 8/4?:. This gives 
$(M.H) = <1)(M,0)-M-H-//V87r. (39.1) 

Hence the potential is 

a)(M, B) = $ + H-B/47r 

= (i>{M,0) + H^/Sn 

= <1)(M, 0) + (B -4nMf/Sn. (39-2) 
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When the magnetic anisotropy of the ferromagnet is neglected, the directions of the vectors 
and H are, of course, the same, and so the vectors in formulae (39.1) and (39.2) are 
placed by their magnitudes. 

Near the Curie point, the magnetization M is small. We shall consider the properties of a 
Irromagnet in this range in the general Landau theory of second-order phase transitions.t 
jn accordance with this theory, we expand d)(M, 0) as a series in powers of the vector M, 
which acts as an order parameter. The expansion of an isotropic function in powers of a 
vector quantity can contain only even powers: 

^ = %+AM^ + BM'^~MH--H^/Sn, (39.3) 

where d),,, A and B are functions only of temperature and pressure.^ 

The Curie point T =T^ (P) is given by the vanishing of the coefficient ^ > 0 for T > 
and < 0 for T <T^. (This relation of the phases occurs in all known ferromagnets, 
although it is not thermodynamically necessary.) Near the Curie point. 


A=a(r-T^) (39.4) 

where o is a positive quantity independent of temperature. The expressions (39.3) and (39.4) 
differ from (19.3) only in that M and H replace and E^. The conclusions which follow 
from these expressions will therefore be stated without repeating the arguments given in 
§19. 

The spontaneous magnetization in the ferromagnetic phase varies with temperature 
according to 

M = J {a{T^-T)l2Bl (39.5) 


Above the Cut 
susceptibility is 


e point there is no spontaneous magnetization, and the magnetic 


;( = l/2a(T-TJ, 


(39.6) 


ie., there is paramagnetism with susceptibility inversely proportional to T—T^ (the 
Curie-IVeiss law). Below the Curie point. 


Z = (dM/dH)ff^o = l/4«(7', - n (39.7) 

't should be pointed out, however, that this quantity is here not the susceptibility in the 
Ordinary sense of the word (i.e. the coefficient of proportionality between M and H), since 
^ 0 even when H = 0. 

The susceptibility (39.7) can actually attain values of the order of unity only in the 
’^mediate neighbourhood of the Curie point. Except in this region, we may suppose that 
^ magnetization M changes only very slightly with the magnetic field and may be 
Jj^garded as a constant for any given temperature. In the following sections we shall assume 
'•’'s to be true. 

I This constitutes a further difference between ferromagnets and ferroelectrics: for the 
atter, dPjdE is in general not small even far from the Curie point. The reason again lies in 
^ smallness of the magnetic moments of the atoms in comparison with the electric dipole 
foments of the molecules. 

It has been mentioned in §19 that the application of an electric field blurs the discrete 
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second-order phase transition point in ferroelectrics. The same is true, of course, for a 
ferromagnet in a magnetic field. Since M and H have the same direction in the exchange 
approximation, the blurring of the transition occurs in that approximation whatever the 
crystallographic direction of H. 


§40. The magnetic anisotropy energy 

As already mentioned, the anisotropy of the magnetic properties of ferromagnets is 
due to the relativistic interactions between their atoms, and these interactions are 
comparatively weak. In the macroscopic theory, the anisotropy is described by the addition 
to the thermodynamic potential of the magnetic anisotropy energy, which depends on the 
direction of magnetization. 

The calculation of the anisotropy energy from the microscopic theory would require the 
use of quantum perturbation theory, the energy of the perturbation being represented by 
the terms in the Hamiltonian of the crystal which pertain to the relativistic interactions. 
The general form of the desired expressions, however, can be deduced without such 
calculations, from simple arguments concerning symmetry. 

The Hamiltonian of the relativistic interactions contains terms of the first and second 
powers in the electron spin vector operators; these are respectively the spin-orbit and 
spin-spin interactions. Both kinds are small in proportion to uVc^ where v is of the order 
of magnitude of the velocities of atomic electrons, and c is the velocity of light. The 
perturbation-theory series for the anisotropy energy is an expansion in powers of this small 
quantity, but, because of the above-mentioned dependence of the perturbation operator 
on the spin operators, the anisotropy energy is automatically obtained as a series of powers 
of the direction cosines of the magnetization vector, i.e. the components of a unit vector m 
in the direction of M. The anisotropy energy like the potential O itself (cf the last I 

footnote), is invariant under time reversal, whereas the magnetization M changes sign. I 
Hence it follows that the anisotropy energy must be an even function of the components of J 

For uniaxial and biaxial crystals, the expansion of the anisotropy energy begins witli| 
squares of these components, and may be written 

where K^^ isa symmetrical tensor of rank two, whose components, like t/aniso itself, hav 
the dimensions of energy density. In uniaxial and biaxial crystals, such a tensor 
respectively two and three independent components. In the present case, however, 
also be borne in mind that one quadratic combination, namely -I- m,, +m^ — ’ 
independent of the direction of m and so cannot appear in the anisotropy energy, e ^ 
the expression (40.1) for uniaxial and biaxial crystals contains only one and 
independent coefficients respectively. 

For example, in uniaxial crystals the anisotropy energy can be written 

t/an.so = K -b m/) = K sin^ 6 


or, equivalently.t 


l^an,so= = - 


the other implies that this quantity is included in the isotropic part oi f 
the expansion (39.3). 
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^vhere 6 is the angle between m and the z-axis, taken to be along the principal axis of 
symmetry of the crystal. If K (a function of temperature) is positive, the anisotropy energy 
js least for magnetization in the z-direction; this will then be the direction of easy 
0ignetization. Such a ferromagnet is said to be of the “easy-axis" type. If K is negative, the 
direction of easy magnetization lies in the xy-plane (the basal plane of the crystal); such a 
ferromagnet is said to be of the “easy-plane” type.f The expression (40.2) is isotropic in the 
xy!-plane. This isotropy, however, does not occur in the higher-order terms in the 
expansion of and these determine (when K < 0) the direction of easy magnetization 

in the xy-plane. The form of such terms depends on the particular crystal system to which 
the crystal belongs. 

For tetragonal crystals, the fourth-order terms include two independent invariants, 
{tnf + and (the x and y axes being taken along the two twofold axes in the 

basal plane). The second of these leads to anisotropy in the basal plane. 

In a hexagonal crystal, the anisotropy energy contains only one fourth-order term, 
proportional to (m^^-fm^,^)^; in this approximation, 

^aniso = sin^ 6 + K2 sin'^ 6, (40.3) 

the coefficient K in (40.2) being here denoted hy K^. The anisotropy in the basal plane, 
however, occurs only in the sixth-order terms; the anisotropic invariant of this order is 

j [ (m,^ -F imj,)® + — im^)^ ] = sin'^ 6 cos 6(^, (40.4) 

the x-axis being taken along one of the twofold axes in the basal plane, from which the 
azimuthal angle ^ is measured. 

Lastly, rhombohedral symmetry allows two fourth-order terms, with the invariants 
{m/ + m/ f = sin'^ 6. (40.5) 

[ (m^ -f im^f + (m^ — im^)^ ] = cos 6 sin^ 6 cos 3(^, (40.6) 

the y-axis being along one of the twofold axes, and (j) being measured from the x-axis. The 
presence of the factor in the second invariant means that the direction of easy 
®^gnetization is at a small angle to the basal plane; since is small, the determination of 
J IS angle would require the simultaneous inclusion of both fourth-order and sixth-order 
^rms, one of the latter being the term with the invariant (40.4). 

Let us now consider ferromagnetic crystals of the cubic system. Their properties differ 
^onsiderably from those of uniaxial (and biaxial) crystals. This is because the only 
Uadratic combination which is invariant under the cubic symmetry transformations and 
can be formed from the components of the vector m is = 1. The first non- 
^snishing term in the expansion of the anisotropy energy for a cubic crystal is therefore 
® fourth, not the second, order. For this reason, the magnetic anisotropy effects in 

Die crystals are in general weaker than in uniaxial and biaxial crystals, 
j f^ubic symmetry allows only one independent fourth-order invariant which depends on 
direction of m. The anisotropy energy of a cubic ferromagnet can therefore be 
^pressed as 
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these two expressions are evidently equivalent, since their difference is 2^^ which is 
independent of direction. 

If K > 0, as for example in iron, the anisotropy energy has equal minimum values for 
three positions of the vector m, namely parallel to the edges of the cube (the x, y, z axes, 
with crystallographic directions [100], [010], [001]). In this case, therefore, the crystal has 
three equivalent axes of easy magnetization. 

If, on the other hand, K < 0, as for example in nickel, then the anisotropy energy is least 
when i.e. when m is parallel to any of the four spatial diagonals of 

the cube, with crystallographic directions [111 ], [111 ], etc. These are then the directions of 
easy magnetization.t 

The next approximation after (40.7) for the anisotropy energy of a cubic crystal 
corresponds to the sixth-order terms. Omitting from these the direction-independent 
invariant (m^)^ and an expression which differs from (40.7) only by the factor m^, we are 
left with just one invariant, which may be taken as Then 

U = KJm/m/ + + m/m/) + . (40.8) 


It should be noted that, strictly speaking, a ferromagnetic cubic crystal, when | 
spontaneously magnetized along one of the directions of easy magnetization, ceases to 
possess cubic symmetry, and so there is a displacement of the atoms, i.e. a distortion of the 
crystal lattice. Such a crystal, when magnetized parallel to an edge of the cube, becomes 
slightly tetragonal, while one magnetized parallel to a spatial diagonal becomes 
rhombohedral. In this respect cubic crystals differ from uniaxial crystals with the direction 
of easy magnetization along the principal axis of symmetry, where a magnetization m this 
direction evidently does not change the symmetry of the crystal. 

We must again emphasize that the above expansion of the anisotropy energy oU 
ferromagnet in powers of the components of the unit vector m is not one in powers of t e 
magnetization M itself (which is not small far from the Curie point); the series converges 
only because the relativistic interactions are weak. Near the Curie point, however, where 
is small, it does become an expansion in powers of M. In the Landau theory, this wou 
imply that the ratio K/M^ in a uniaxial crystal, with K from (40.2), must tend to a 
limit as r ->■ r^. To illustrate this statement, let us consider, for example, the transition r 
a paramagnetic phase to a ferromagnetic phase of the easy-axis type. According to ( ^^^ 
and (40 2), the quadratic terms in the expansion of the thermodynamic potential in po 
of the components of M ate AM.^ + (A + K.'M‘) (M/ + M/). The ttansition ^ 
given by the vanishing of the coefficient of M/; the coefficient of M, + M, does 

^’“^LrlJjn a cubic ferromagnet the ratio K/M^ with K from (40.7), must tend to a non* 

In Suctuation range, however, this behaviour of the anisotropy coefficients does no' 
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§41- The magnetization curve of ferromagnets 

Let us examine the relation between the magnetization of a uniaxial ferromagnet and the 
piagnetic field in it, taking the particular case of an easy-axis material. The anisotropy 
energy may here be conveniently written in the form 

t^aniso = ilSM^sin^e, (41.1) 

vvith the dimensionless coefficient ^ > 0 (and K = 

It should be recalled that we suppose the magnitude of M to be independent of H, so that 
only rotations of M are considered.^ It is evident from symmetry that the vector M lies in a 
plane through the z-axis and the direction of H (if terms of higher order, anisotropic in the 
xy-plane, are neglected in the anisotropy energy). We take this as the xz-plane. The 
thermodynamic potential, including the anisotropy energy, is§ 


$ = <ho(M)+ - M • H - HVStc 

= %{M) + sin^ e-M(H^ sin6 + H,cos6 )-H^/Sn. (41.2) 

The dependence of M on H is given by the equilibrium condition 30/36 = 0, whence 
(SMsinflcose = cose-H, sine. (41.3) 

This is an algebraic equation of the fourth degree for ^ = sin 6 : 


in which the coefficients of odd powers of ^ are not zero. This equation has either two or 
four real roots (all less than unity). Since all such roots correspond to extrema of 3>(e), it is 
dear that, in the first case, this function has one minimum and one maximum and, in the 
second case, two minima and two maxima. In other words, the number of possible 
directions of the magnetization M for a given field H is one and two respectively. In the 
second case, one direction (corresponding to the lower minimum of 3>) is thermodynami¬ 
cally completely stable, while the other (corresponding to the higher minimum) is 
Ihermodynamically metastable. 

Either of the two cases can occur, depending on the values of and . When these two 
Parameters vary continuously, one case passes into the other at the point where one 
Maximum and one minimum coalesce. The curve of 3>(6) then has a point of inflection 
jastead of an extremum, i.e. both 30/36 and 3^0/36^ are zero. Writing equation (41.3) in 
'ne form 


. = m 


fcolT*’® expression (41.1) for the anisotropy energy. It should be mentioned, 

.^r^er. that the expansion of which (41.1) is the first term is usually not rapidly convergent in practice For a 
■factory quantitative description of the phenomena, it is therefore necessary to include also the term of the 
^^t (fourth) order. 

considered here, another process can occur in ferrites in very strong fields, 
fn n ' towards each other and become parallel. This is found, however, 

H , /^change fields H ~ TJfi; for instance, in the ferrite Fe0.Fe203 (r. ~ 580 °K, /t ~ these fields are 


- 10’ Oe. 

^ ^ By including t/anisothe thermodynamic potential 4>, we imply that the anisotropy constants a 
®''fn elastic stresses. 


; defined for 
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and differentiating with respect to 6 , we have HJsirf 6 = - HJcos^ 0. Eliminating 6 from 
these two equations gives 


In the H H,-plane equation (41.4) represents a closed astroid curve of the kind shown in 
Fig. 21. It divides the plane into two parts, in one of which rnemstable states can exist, \vhile 
in the other they cannot. It is evident without further investigation that the region where 
metastable states do not exist is that outside the curve, because for H ^ oo only one 
direction of M can be stable, namely that of the field H. 



The existence of metastable states means that what is called hysteresis can occur; his. 
an irreversible change of magnetization on passing through these states when the externa 
magnetic field is varied. The curve shown in Fig. 21 is therefore the absolute limit 
hysteresis; this phenomenon cannot occur for fields outside the curve.t 

States in which the field H is perpendicular to the direction of easy magnetizatio 
= H,H^ = 0) require special consideration. The thermodynamic potential is 


- + yM^ sin^ 6 - HM sin 6 . 


If H > )SM, ^ has only one minimum, at 6 = jn, i.e. the magnetization is parallel to 
field. If. however, H < fiM, then ^ has a minimum when 


M, = Msme = H/P, 

to which there correspond two possible positions of the M (at angles 6 and ^ 

symmetrical about the x-axis. Thus in this case there are two equilibrium states, whi 
the same value of 0) and are therefore equally stable. 

This result is very important, since it means that two phases can 
the field H is the same but the magnetization M (and therefore the J bod)' 

Thusanewpossibifityappearsforreducingthetotalthermodynamicpot^^^^^^^^^ 

its volume may be divided into separate regions, in each of which the mag ^ 

one of its two possible directions. These regions are called regions of spontane 
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^^lagnetization or domains. The actual determination of the thermodynamic equilibrium 
structure of a ferromagnet requires a consideration of the shape and size of the body as a 
whole. We shall return to this problem in §44. 

Let us consider a portion of the body which is small compared with the total volume but 
large compare^with ^e domains. The field H, can be regarded as constant in this portion; 
we denote by M and B the values of M and B averaged over its volume. As well as H^, the 
transverse component = H^/p of the magnetization is constant. The longitudinal 
component M^, however, has opposite signs in different domains, so that its mean value 
certainly cannot exceed \M^\. Since H^ = 0 everywhere, the mean induction is therefore 

= B,<47iJ(M^-^y (41.7) 

These formulae give the range of values of the mean induction corresponding to the 
domain structure of a uniaxial ferromagnet. 

The relation between M and H for a cubic crystal can in principle be investigated in the 
same way as was done above for a uniaxial crystal. However, we shall not pause to discuss 
this, because the equations are more complex, and explicit analytical formulae cannot be 
obtained. 

PROBLEMS 


Problem 1. A uniaxial ferromagnetic crystal is in the shape of a spheroid, the axis of easy magnetization 
being the axis of revolution, and is placed in an external magnetic field §. Determine the range of values of ftfor 
which the body has a domain structure. 

general properties of an ellipsoid in a uniform external field (§8), the induction B 
a neld H (— H) averaged over the domain structure are related to § by 

nB, + (1 - n)H^ = i(l - n)B^ + 5(1 + n)H, = 

where n is the demagnetizing factor in the direction of the axis of revolution (taken as the z-axis). Putting H =0 

and using formulae (41.7). we obtain t 6 z 


" l + 2;r(l-n)/^’ ' „ ""V V 

Elimination of gives the required inequality 

I 2 

^ (47inf [P + 2n{l~n)f^ 

he range in which there is a domain structure. 

a Polycrvstallinp magnetization averaged over the crystallites (which have uniaxial symmetry) for 

> rystalline body in a strong magnetic field [H > 4;iM). 

^"SnSvelv Ma^nd H m magnetizatic 

yt s = e -tPntt nrn m ' H direction, i.e. tl 

sin S- (B - ^) and equating to zero the derivative 54/56, w 

n « - (^M ///) sin 6 cos e. The average magnetization is clearly parallel to H, and is 

M = = M(1-p^) = jwfl 

Ehebarn ^ " Scos 

the rf averaging over the crystallites. Assuming that all directions of r 

»ystallites are equally probable, we have directions of the axis of easy magnetization c 


M = m| 
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Problem 3. The same as Problem 2, but for the case where the crystallites have cubic symmetry. 
Solution. The conditions for a minimum of the expression 
+ Mf) - (H^M^ + 

with K = in (40.7) and the subsidiary condition Mf- + Mf + Mf = constant, are 

+ ff, = +H^ = IM^, pMf + H^ = 

where I is an undetermined Lagrange multiplier. For large H, we therefore have 
M, = + + .... etc.: 


adding the squares of the three equations, we 
found from 

S sir 


obtain i.e. 2 S H/M. The angle S between 


i"' S 


(M X H)" 


M and H is 


H' 




the 


where the summation is over cyclic permutations of the suffixes x, y, z. Averaging this expression 
orientations of the crystallites is equivalent to averaging over directions of the vector H. The latter averaging is 
effected by integrating over the angles which specify the direction of H, and the result is 

2lF^\ 


M = M(l-iS^)=Ml 1- 


105H^ , 


§42. Magnetostriction of ferromagnets 

A change in the magnetization of a ferromagnet in a magnetic field causes a deformation 
in it; this phenomenon is called magnetostriction, and may be due to either exchange 
interactions or relativistic interactions in the body. Since the exchange energy depends only 
on the magnitude M of the magnetization, its value can change only when M changes in the 
magnetic field. Although the latter change is, in general, very small, the exchange energy is 
large compared with the anisotropy energy. Hence the magnetostriction effects from each 
type of interaction may be of comparable magnitude. 

This happens, for instance, in uniaxial crystals. Marked deformations resulting from a 
change in the direction of M occur in fields H ~ PM; the change in the magnitude M is 
considerable when H ~ 47tM. If these two values of H are almost the same, it is in general 
necessary to take aecount of both effects in discussing the magnetostriction of uniaxial 
ferromagnets. We shall not pause here to derive the formulae, which are fairly complex. 

In cubic crystals the situation is different, because the anisotropy energy is of the fourth 
order and therefore relatively small. A considerable magnetostriction, due to the change in 
the direction of M, occurs even in comparatively weak fields, where the change in the 
magnitude M may be entirely neglected. Let us consider these effects. 

The change in the relativistic interaction eiprgy in the deformed body is described by the 
inclusion in the thermodynamic potential 3> of magnetoelastic terms depending on the 
components of the elastic stress tensor and the direction of the vector M (N. S. Akulov. 
1928). The first such terms which do not vanish are linear in (7^^, and quadratic in the 
direction cosines of M because of the symmetry with respect to time reversal. In general, 
therefore, the magnetoelastic energy is given by an expression of the form , 


,-el= - 


(42.1) 
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where 0;^,^ is a dimensionless tensor of rank four, symmetrical with respect to the pairs of 
suffixes i, k and /, m (but not with respect to interchange of the two pairs). Near the Curie 
point, where the expansion in powers of the direction cosines of the vector M is equivalent 
to one in powers of its components, the quantities tend to constants. 

In calculating the number of independent components of the tensor it must be 
borne in mind^that the terms in (42.1) which involve the components of inln the form 
+m^ are independent of the direction of m, and so may be omitted from the 
nagnetoelastic energy.f Thus we find that, in a cubic crystal, the magnetoelastic energy 
contains two independent coefficients; we shall write it as 

^m-el = “ 

~ 2^2 Ky niy + ). (42.2) 

The strain tensor is obtained by differentiating ^ with respect to the various components 
(^ik- where 3> includes also the ordinary elastic energy (with reversed sign; 

see the first footnote to §17). For a cubic crystal, the latter energy involves three 
independent elastic coefficients, and can be put in the form 

^el = (O'xx^ + + O'zz^) + (^xx + ’^yy + ^ zzf 

+ 7'3K/+f^xz^+nyz^), 

where and ^3 are positive. The strain tensor isj 

“xx = (f'l + f'2)0’xx + 7^2 (<’■>> + O’zz) + 

^xy = f'30'x> + «2'”x'”y^ 


(42.3) 
I (42.4) 


and similarly for the other components. 

These formulae give all the magnetostriction effects in the range of fields considered. In 
particular, if there are no internal stresses the change in the deformation resulting from a 
cbange in the direction of magnetization is given by 


Wxx = «i fn, , u,y = fl2 my, etc. (42.5) 

t should be recalled that the magnitude of the deformation itself is to some extent 
^irary, because the direction of m for which the deformation is supposed zero is 
^■rarily chosen. 

orde^^ f tensor found by solving a specific problem (e.g. for a clamped crystal) is in 
« ° *™giiitude aajfi, where a and ^ are the orders of magnitude of the coefficients 

and the elastic coefficients respectively. In this sense, the magnetoelastic energy (per 
^volume, as usual) is of the order of The coefficients a are of the first order in the 
—•vistic spin-spin interaction, and the magnetoelastic energy is therefore of the second 
fcactfon """If "‘I’ the anisotropy energy is of the first order in the relativistic 

^mion, and is therefore usually much larger than the magnetoelastic energy. In cubic 
tals, the anisotropy energy is of the second order in this interaction, and is fherefore in 


is consequently some arbitrariness in the choice of the a,which simnlv .1, ■ 

^orme?°°'"”® mechanical forces being absint) we regard “1"^ 

* In differentiating the third footnote to §17 should be recalled. 
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general comparable with the magnetoelastic energy.t It may therefore be necessary to take 
into account simultaneously both forms of energy (for instance, in considering the 
magnetization curve), which greatly complicates the problem. 

Let us now consider magnetostriction in fields so strong {H ^ 47tM) that the anisotropy 
energy is unimportant and there is no domain structure, so that the directions of M and H 


may be assumed to coincide. 

Since the anisotropy energy is neglected, the particular symmetry of the crystal is of no 
importance, and the formulae given below are valid for any ferromagnet. 

Let the body be placed in a uniform external magnetic field §. Its total thermodynamic 
potential ^ ist , w.-a 


(42.6) 


where ^ = MV is the total magnetic moment of a body uniformly magnetized in the 
direction of the field; we omit the term which is unrelated to the magnetic field. The 
strain tensor averaged over the volume of the body is — (l/L)5^/5cr;j, whence 


. ^ p ( 42 . 7 ) 

V d(Ji^ 

Thus the deformation is determined by the dependence of the magnetization on the 
internal stresses. 

For cubic symmetry, any symmetrical tensor of rank two characterizing the properties of 
the crystal reduces to a scalar multiple of 6 ^^. This is true, in particular, of the tensor 
d(MV)/d(Ti^, so that the magnetostriction deformation amounts in this case to a uniform 
compression or extension. 

If we are interested only in the change 3Vin the total volume of the body, we can obtain 
it by simply differentiating ^ with respect to the pressure: 

dV= d^/dP = - 5?(M V)/dP, (42.8) 

where P is to be regarded as a uniform pressure applied to the suface of the body. I 


PROBLEMS 

Problem 1. Find the relative extension of a ferromagnetic cubic crystal as a function of the direction of the 
magnetization m and the direction of measurement n. 

Solution . The relative extension in the direction of the unit vector n is given in terms of the strain tensor by 
Sl/I = Uii^n^n^. Substituting u,i (in the absence of internal stresses) from (42.5), we have 
Sl/I = a, {mf nf + mfnf + mfnf) + 

Only the difference in the values of this quantity for different directions m and n has absolute significance. For 
instance, if m is along the x-axis, the difference in the values of 51/I along the x and y axes is a,. If m is along one of 
the spatial diagonals, the difference in the values of Sl/l in that direction and along the other three spatial 
diagonals is 402/9. 

Problem 2. Determine the change in volume in magnetostriction of a ferromagnetic ellipsoid in an external 
field S ~ 4;iM parallel to one of its axes. The ferromagnet is assumed to be a cubic crystal. § 

t In cubic crystals too, however, the magnetoelastic energy may be much smaller than the anisotropy energy 
In iron at room temperature, for example, the ratio is ~ 10“^. 

{ Here the definition of yo is that given in §12, which is applicable except when the deformation of the body is 
appreciably inhomogeneous. 

§ In a uniaxial ferromagnet with § ~ 4;iM the anisotropy energy would have to be taken into account, but this 
is not necessary in a cubic crystal. 
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Solution . When the anisotropy energy is neglected, the range for which the domain structure exists is given 
< 47tA^when // = 0. The bar denotes averaging over the volume of the body; cf. §41. In an ellipsoid 
nB + (l-n)H = $; putting H = 0j_we find that the domain structure exists when $ < 4iinM. Since tiB = 4nnM 
= g, the mean magnetization is M = f)/4nn. Hence the thermodynamic potential is 


^ = - J Mdf) = - ffVI%nn. 


( 1 ) 


If g > 4nnM, the ellipsoid is magnetized entirely in the direction of the field, and M = M. Then 
= — M ^V + 2nM^Vn. 

The expressions (1) and (2) are the same for $ = 4nMn. 

The required change in volume is obtained by differentiating ^with respect to pressure: 

dV 
Snn dP 
e{MV) 


For g > 4nnM we obtain (42.8). 


§43. Surface tension of a domain wall 

As already mentioned in §41, there exists a wide range of states in which a ferromagnet 
must have a domain structure, i.e. must consist of regions with different directions of 
magnetization. This is true, in particular, of a ferromagnet which is not in an external 
magnetic field. 

Thermodynamically, adjacent domains are different phases of a ferromagnet, with 
different directions of their spontaneous magnetization. Let us consider, first of all, the 
properties of the phase boundaries (doma/n walls) as such, and calculate their surface 
tension (L. D. Landau and E. M. Lifshitz, 1935). 

The phase boundaries are in reality fairly narrow transition layers in which the direction 
of the magnetization varies continuously between its directions in the two adjoining 
domains. The “width” of such a layer and the manner in which M varies within it are given 
by the conditions of thermodynamic equilibrium. The additional energy due to the non¬ 
uniformity of the magnetization must be taken into account. The largest contribution to 
this non-uniformity energy is given by the exchange interaction. Macroscopically, this 
energy can be expressed in terms of the derivatives of M with respect to the coordinates. 
This can be done in a general form if the gradient of the direction of M is supposed 
relatively small, i.e. if the change in the direction of the magnetic moments occurs over 
distances large compared with the interatomic distances. In the present case, this condition 
is evidently fulfilled, because a considerable difference in the directions of the magnetic 
moments of adjoining atoms would lead to a very large increase in the exchange energy, 
and is therefore thermodynamically unfavourable. 

We denote the non-uniformity energy density by There can be no terms of the 

form {M)dMJ dx^ linear in the first derivatives, because of the requirement of symmetry 
under time reversal, which changes the sign of M but leaves the energy unchanged. The 
crystal symmetry might allow the existence of terms containing products of the derivatives 
dMJdXf^ with the components of M itself. We are interested here in the non-uniformity 
exchange energy; the corresponding terms in must be invariant when the vectors M 
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are simultaneously rotated by the same amount throughout all space (with no change in 
the coordinates used).! The terms involving such products can then only have the form 

a,(M)M,5M,/5x; = ^a(M)-gradM^. 

Such terms cannot, however, actually represent the non-uniformity energy, even if the 
magnitude as well as the direction of M is assumed to change within the crystal. The reason 
is that not Vnon-M itself but only its integral over the volume of the body has a real 
thermodynamic significance, and such an integration would reduce terms of the above 
form to expressions depending only on the values of the magnetization on the surface of 
the body, not on its variation within the body.} 

For the same reason, we need not include in l7non-u> among the terms of the next order of 
smallness, those linear in the second derivatives of M with respect to the coordinates, since 
on integration over the volume of the body they become expressions quadratic in the first 
derivatives. The quadratic expressions are the principal non-vanishing terms in the 
expansion of the non-uniformity exchange energy. 

The most general form of these terms is 

t^non-u = (dMfdx^), (43.1) 

where is a symmetrical tensor. For the ferromagnetic order to be stable, this expression 
must be positive definite, i.e. the principal values of the tensor must be positive. In a 
cubic crystal, this tensor reduces to a scalar (Kj^ = a^,*, a > 0), so that§ 

l^non-u = {oi(dMldXi) • {dMIdxf (43.2) 

In a uniaxial crystal, has two independent components, and the non-uniformity energy 
is of the form 

l^non-u = iai [{dMIdx? F {dM/dyf ^F^a^idMIdzf . (43.3) 

It must be emphasized that, far from the Curie point, the expressions (43.1)^(43.3) are 
to be regarded as the first terms in an expansion in powers of the derivatives of the unit 
vector m = M/ M, not of the magnetization IVI itself Only near the Curie point do they 
become an expansion in powers of the derivatives of M. Accordingly, in the Landau theory 
the coefficients in these expressions should tend, as T -> 7', to non-zero limits. (See §47 
regarding the role of fluctuations here.) 

As an example, let us consider the boundary between phases in a uniaxial crystal of the 
easy-axis type, assuming that the vector M is parallel (or antiparallel) to the direction of 
easy magnetization (the z-axis). 

The structure of the transition layer is determined by the condition that the total free 
energy of the layer be a minimum. || Here the exchange energy tends to increase the width of 


t This means that the scalar expression l/non-u must be so constructed that the magnetic and coordinate vector 
suffixes are each interchanged among themselves but not with each other. 

t The symmetry of the crystal may, however, allow the existence of terms having the form M^dM Jdx, and 

not of an exchange nature. This would alter the nature of the ferromagnetic order in the crystal; see §52. 

§ In order of magnitude (in iron, for example), a ~ 10“cm. 

II When the magnetostriction is neglected, there is no need to distinguish the free energy ^ from the 
thermodynamic potential If, however, we intend to take account of the elastic and magnetoelastic energies of 
the non-uniform deformation (as may sometimes be necessary; see Problem 2), the total free energy must be used. 
Here it should be remembered that the equilibrium equations of the medium are found by varying its total free 
energy with respect to the components of the displacement vector u; cf. the derivation for a fluid at the end of §15, 
the equilibrium equation f = 0 being obtained by equating to zero the variation 6 S'. 
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the layer (i.e. to make the direction of M vary less rapidly). The anisotropy energy has the 
UPpsite effect, because any deviation of M from the direction of easy magnetization 
increases this energy. 

We take the x-axis perpendicular to the plane of the layer; the direction of M depends 
Qply on X. The rotation of the vector M across the layer must take place in the yz-plane, i.e. 
0=0 everywhere. This is seen as follows. The non-uniformity and anisotropy energies in 
a uniaxial crystal are independent of the plane in which the rotation of the magnetization 
takes place. The presence of a non-zero component would necessarily result in a 
^Bietic field which was thermodynamically unfavourable, because of the additional 
llpetic energy. For it follows from the equation div B = dB^/dx = 0 that 
g = constant throughout the transition layer, and, since = 0 and H^ = 0 within the 
domains, we find that B^ = 0 everywhere. Hence, together with the component / 0, a 
field = -4nM^ must occur. Correspondingly, the free energy F contains a termf 
-M^H^ - > 0. 

Let 6 be the angle between M and the z-axis. Then the components of M are = 0, 
My= M sin 6 , M^ = M cos 6 . The sum of the non-uniformity energy (43.3) and the 
j^iotropy energy (41.1) is the integral 


I + + = | + p smH)dx, (43.4) 


where the prime denotes differentiation with respect to x. The remaining terms in the free 
energy are independent of the structure of the layer, and so can be omitted here. 

To determine the function 6 (x) which makes this integral a minimum, we write down the 
corresponding Euler’s equation aj 6" —)? sin 6 cos 6 = 0. Assuming the width of the 
transition layer small compared with that of the domains themselves, we can write the 
boundary conditions on this equation as 


(43.5) 


6(+oo) = 0, 6( —oo) = 7r, 6'(±oo) = 0. ^_, 

These state that adjoining domains are magnetized in opposite directions. The first integral 
° Euler’s equation which satisfies these conditions isf 

e'"-(^/ai)sin"e = 0. (43.6) 

^ second integration gives 

cos 6 = tanh [x,y()?/ai) ], (43.7) 

hich describes the manner of variation of the magnetization in the transition layer. The 
th of this layer is <5 ~ 


precisely, we should use here the thermodynamic potential ST'-, cf. the end of §44 For a given value of 
this potential must have a minimum with respect to or //,. When = 0, however, F and f' are the 

This exnressinn ran, of course, be written down directly (without using Euler’s equation) if we notice that the 


tCBrai ^ expression can, ui eom uc niiiLcii nnvvii vnnnnui uMiig Euiers equation) if we 1 

Potenf- action integral for a one-dimensional motion of a particle in a field whose 

j^ ntial energy is -F sin^e, with 6 acting as the coordinate and x as the time. Then (43.6) expresses the 
ition of energy for the particle. 
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With (43.6), the integral (43.4) becomes 


j e'^dx = MV(“i^) jsinede 


= 2M^ J(a J). 

If the boundary between domains is regarded as a geometrical surface, this quantity is the 
surface tension which must be ascribed to the boundary in order to take account of the 
energy needed to create it. The surface tension of a domain wall may be denoted by 
where A has the dimensions of length. Then 

A = 2y(ai)?). 

PROBLEMS 

PROBLEM 1. Determine the surface tension of a domain wall in a cubic ferromagnet with the dire^-^ 
Lifshitz, 1944; L. Neel, 1944). 

Solution (a) The domain waU is parallel to the yz-plane, all quantities in it depend only on “ordinatt 

the anisotropy energy (with K = kfiMf- we find as the free energy of the wall 
-‘ AE' j (ae''' + ^sin^ecos^e)dx, 

whereeistheanglebetweenMandthez-axis.ThefirstintegralofEuler’seq^^^^^ 

issiill-sssss 

which is not zero even with the above approximations: 


= J(ap). 2 j sin e cos 6 dt) = 




l/^niso = -^M^sin^e (3cos^e + 1). 


;e the first integral of Euler’s equation for the variational problem: 
e'2 = zlsin^e(cos^e + B), 


the wall plane. Hence, again using t ' 


g the conditions (43.5), we find the equation for the wall structure: 

/l+fi n 

, l.4(l+B)] = ^-^cote, 
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sinh yj(pla)^ = 2 cot 6. 

Xhe surface tension is 

A = +B) + Bsinh~‘B-i}, (3) 

with the above values of A and B, 

A,no,= 1.38V(a/;). 

Problem 2. Find the structure of the domain wall in the (100) plane whose surface tension was calculated in 
Problem 1(a) (E. M. Lifshitz, 1944). 

Solution. As already mentioned, a non-zero width of this wall occurs only when the magnetostriction 
energy is taken into account. The structure of the wall is determined by the condition for the free energy to be a 
minimum; the free energy density f is to be expressed in terms of (cf the fourth footnote to §43). The 
corresponding expressions for the magnetoelastic and elastic energies are analogous to (42.2) and (42.3) with 
different coefficients; 


-el = fc. + u„mf) + 2fc, 

= 5^1 + u,, -1- u„)^ -I- Aj (u,/ + + u^f); 

here we have put = 0. 

Not only the magnetization distribution but also the strain in the transition layer must be a function of x only. 
It follows that the y and z components of the displacement vector u must have the form = constant x y, 
u, = constant x z; if the constants here were replaced by functions of x, then and would^depend on y or z! 
Thus Uj,, and are constants. From the general equations of elastic equilibrium, dajdx^ = 0, it follows that 
(J,; = 0; at X = + 00, where the strain is zero, we must have Oji = 0, and therefore = = G 

everywhere. Calculating these components of the stress tensor as the derivatives = dF/du^^, we find that 
% - = 0. “j(x = constant. Thus all the Uji are in fact constant. It is therefore sufficient to calculate their values 

at infinity, where all the rr,.* = 0, = 0, = ± 1. The equations = 0, rT„ - rr,, = 0 give 

Omitting the constant terms from and we find that the further term sin^fi is to be 

R _ 01,*2 ^™’n-u+ Oaniso- The determination of 0(x) thus amounts to solving equation (1) with A = p/a, 
- 2b, /A, pM . The constant B, which represents the ratio of the magnetostriction and anisotropy energies, is 
smallT Putting B = 0 in (3), we obtain the value of A,ioo, already found in Problem 1(a). From (2) we find as the 
mstribution of magnetization in the wall 

sinh ^{p/a)x = f{/.ipMf2bf)cote. 

The width of this distribution. 


S'-f {a/P) log [X.pMfbf), 

'^'Pcnds considerably on the magnetostriction constant.}: 

3 crystal of the same kind as in Problem 2, find the surface tension of a domain wall between 
P'ane ^ T) directions, in two cases: (a) the wall is parallel to the (100) 

■ (h) the wall IS parallel to the (011) plane (S. V. Vonsovskil, 1944; L. Neel, 1944).§ 

cases, the magnetoelastic energy may be neglected. 

"'C ^>oundary ‘''TTetence from Problem 1(a) lies in 


e(-co) = 0, e(-l-co) = Jrt, p'(+co) = 0. 

"'311 structure is given by the solution 
. _ tan e = exp ^(P/a)x, 

j ^or example, in iron at room temperature B ~ 2 x 10“^. 

'"o 9,? ^ 18O-4e0ree wall (as regards the angle of rotation of the vector M in it) decays as it were into 

§ walls, separated by a region with 6 = jn which tends to infinity. 

t>0Unri “ a uniaxial one; see the second footnote to §44), a 90-degree wall is a true phase 

°3ry, since both domains are stable phases and each is magnetized in one of the easy directions 
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A?f>co, = W(“«. 

half the value for a 180-degree wall. 

(b) Together with the crystallographic x, y, z axes, we use axes x, rj, C as shown in Fig. 22: the x-axis is 
perpendicular to the plane of the diagram, and the arrows show the directions of M in domains separated by the 
plane i/ = 0. In the transition layer, M rotates over half a circular cone with its axis along the i/-axis; 
M = constant = 1 ijl, so that div M = M,' = 0, as it should be (the prime denotes differentiation with respect 
to n) Let 4> denote the angle between the projection of M on the xC-plane and the C-axis;.</) varies from 0 to 7t. 
Then m, = 1/V2, m, = {1/^2) cos </., m,= {l fl) sin 0, m, = i(l -cos <t>), m, = i(H-cos</.). The non¬ 
uniformity and anisotropy energies are 

1/non-U = l/gniso = (sin^ </) - 1 sin'* </)). 



Fig. 22 

The surface tension is 

Aoio, = ivW)- 2 j sine[l-|sin"e]2de 



= 1.73 X (aft. 

Problem 4. Find the surface tension of a domain wall in a uniaxial crystal when the transition between 
domains takes place by a change in the magnitude of M without rotation, its direction being reversed as M passes 
through zero. The dependence of the free energy on M (when H = 0) is taken in the form of the expansion (39.3), 
appropriate near the Curie point (V. A. Zhirnov. 1958). 

Solution. Throughout the transition layer, is equal to M, and varies in the x-direction, which is 
perpendicular to the wall plane. The free energy density is, taking into account the non-uniformity energy, 

f = Fo-lA|M^ + BM'‘-t-iaiM'^. (1) 

The equilibrium value of the magnetization within the domains is here denoted by Mo: = \ A |/2B; see (39.5). 

With m = M/Mo (m # 1), we can write the free energy of the wall as 

i|A|Mo^ j + (aJ\A\)m^]Ax-. 

the additive constant in F is chosen so that F is zero within the domains. This integral is to be minimized with the 
boundary conditions m(-t- oc) = 1, m( — cc) = — 1, m'(± cc) = 0. The first integral of Euler’s equation for this 
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variational problem is (a,/l>l|)m'^ = (1-m^f. Hence 

m(x) = tanh ^(\A\Iol^)x, 

the calculation of the integral gives for the surface tension Mf-h the value 

A = ^V(“il'41)- (2) 

^is wall structure can in principle occur sufficiently close to the Curie point (if pl\A\ tends to infinity as /" -► CJ, 
where a change in the magnitude of M becomes energetically more favourable than a deviation of M from the 
direction of easy magnetization. 

04 . The domain structure of ferromagnets 

Let us now ascertain the actual shape and size of the domains.t 
Some conclusions concerning the shape of the surfaces separating the domains may be 
obtained directly from the boundary conditions on the magnetic field. Since the field H is 
the same in adjoining domains, the condition of continuity of the normal induction B„ 
reduces to the continuity of M„. In uniaxial crystals, the sign of is different in different 
domains, but and M^. are the same. Under these conditions the continuity of M„ means 
that the surface of separation must be parallel to the z-axis, i.e. to the direction of easy 
magnetization. 

The shape and size of the domains in thermodynamic equilibrium are given by the 
condition that the total free energy should be a minimum. They depend considerably on 
the actual shape and size of the body. In the simplest case, that of a ferromagnet in the form 
of aflat plate, the domains may in principle form parallel layers across the body from one 
surface to the other. In what follows we shall take this case. 

The formation of an entire new boundary between domains results in an increase in the 
total surface tension energy. This energy consequently tends to reduce the number of 
domains, i.e. to increase their thickness. The excess energy near the outer surface of the 
body, where the domains emerge, has the opposite effect. In the body the magnetic field 
H = 0, and the anisotropy energy is also zero, because the vector M is in a direction of easy 
ntagnetization. Near the surface, however, this is not so. 

The way in which the domains emerge at the surface of the body is different in the 
lilting cases of large and small magnetic anisotropy. A natural measure of this quantity in 
present case is not the coefficient P itself in (41.1) but P/4n. This is seen from the 
J'Pression = 1 +4nlP (41.7) for the transverse magnetic permeability of a uniaxial 
^rromagnet. 

^^When the magnetic anisotropy is large, the layers must emerge at the surface with no 
surf direction of M (Fig. 23a, p. 154, where for simplicity we suppose that the 

0)3 ^ perpendicular to the direction of easy magnetization). Near the surface there is a 
of which penetrates into the surrounding space, and into the body, to distances 

ne order of the layer thickness a. 

the" opposite case of weak anisotropy, a more favourable disposition is that where 
e IS no magnetic field, and M deviates from the direction of easy magnetization For 
= we must have everywhere div B = 4?: div M = 0, and M„ must be continuous at all 


6vi> ^ concept of domains was first put forward by P. Weiss (1907). The thermodynamic theorv of domains v 

by L. D. Landau and E. M. Lifshitz (1935). 
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Fig. 23 


domain boundaries and at the free surface. This is achieved by the setting up of domains of 
closure with triangular cross-section (Fig. 23b), in which the magnetization is parallel to the 
surface of the body. The total volume of these regions, and therefore the anisotropy energy 
in them, are proportional to the layer thickness a.f 

Thus in all cases the emergence of the domains at the surface of the body results in an 
excess energy which increases with the thickness of the domains. This effect therefore tends 
to reduce the domain thickness. 

The actual thickness of the domains is determined by the equilibrium of the two 
oppositely acting effects, namely the surface energy of the domain walls and the energy of 
emergence of the domains at the surface of the body. The number of plane-layer domains in 
a plate is proportional to 1 /a, while the surface-tension energy at the surfaces separating 
them is proportional to their total area, i.e. to l/a, where I is the thickness of the plate. The 
energy of emergence is proportional to a. The sum of these two energies has, as a function 
of a, a minimum when a has a value proportional to y/l. 

For example, in the case of weak anisotropy (Fig. 23a), the energy of emergence per unit 
area of the two sides of the plate is ~ the surface energy is M^Al/a (the plate 

thickness / is, of course, assumed much greater than the domain thickness). Hence 

a ~ V(/A/)?) ~ y («). (44.1) 

Thus the thickness of the domains increases with the dimension of the body, but the 
quantitative law acc I of this increase is based on the assumption that the domains are of 


t It should be emphasized that the boundary between the principal domains and the domains of closure, which 
separates regions where the directions of M differ by 90°, is not in this case (i.e. in uniaxial crystals) a phase 
boundary in the literal sense, as is seen from the fact that a state with magnetization perpendicular to the easy axis 
(when H = 0) is in itself unstable and not a possible phase of matter. Strictly speaking, the distribution of 
magnetization shown in Fig. 23b relates only to the limit ^ 0. Even in the first approximation with respect to the 

small quantity /i/47t, deviations from this pattern and transitions between principal domains and domains of 
closure occur at distances less than the domain thickness a by this small factor, and fields ~ pM occur (although 
this does not affect the estimate of the energy of domain emergence at the surface of the body). 
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^nstant thickness, and clearly cannot be valid for all values of /. The reason is that the 
thickness of the domains at the surface of the body cannot exceed some limiting value 
tvhich depends on the properties of the ferromagnet but not on the shape and size of the 
body. The value of is determined by the point at which, as a increases, the splitting of the 
domain near the surface to a depth ~ a becomes thermodynamically favourable. Such a 
point must necessarily be reached, since the energy of the emergence of one domain 
increases as a^, whereas the excess surface-tension energy resulting from the splitting of the 
domain increases only as a. 

Thus we conclude that, as the size of the body, and therefore the domain thickness, 
increase, a progressive branching of the domains occurs as they approach the surface of the 
l^y (E. M. Lifshitz, 1944).t In principle, when I becomes sufficiently great, the branching 
continues until the thickness of the branches at the surface itself becomes comparable with 
the domain wall width 6. 

The function a{l) can be found for this limiting case. In making estimates, we shall take 
the particular case of weak anisotropy. 

Figure 24 shows the domain branching pattern. The energy of emergence of the domains 
is here made up of the additional surface energy of wedge-shaped domains and the 
anisotropy energy due to the deviation of M from the z-axis as a result of branching; here 
there are no triangular domains of closure. Assuming rapid convergence of the sum over 
successive branchings, it is sufficient to consider the first wedge, whose length is denoted by 
h. It is evident from energy favourability arguments that h is much greater than the 
thickness of the wedge (or, equivalently as regards order of magnitude, than a), i.e. the angle 
S between the wedge boundary and the z-axis is S ~ a/h 1. The surface energy of the 
wedge is ~ hM^A, and the anisotropy energy due to the wedge is ~ haPM^ 

a^PM^/k, the energy of emergence, the sum of these two, is least when ~ a^P/A, i.e. 
when 

a/h ~ ^{A/ap) ~ « 1 (44.2) 



' at which branching begins varies greatly according to the nature of the magnetic 
J^opy and according to the configuration of the sample surface in relation to the crystallographic axes. The 
>al stages of domain branching have been investigated by E. M. Lifshitz {Journal of Physics 8, 337, 1944) 
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(with the assumption, of course, that a > ^). The energy of emergence per unit area of the 
plate surface is then ~ {Ph)ja. The surface energy of the principal domain 

boundaries is ~ M^Al/a. Minimizing the sum of these two expressions gives 

a ~ ~ (44.3) 


(I. A. Privorotskir, 1970).t 

To conclude this section, let us consider the phase equilibrium conditions in a 
ferromagnet from a somewhat more general standpoint than was needed for the above 
discussion, by no longer assuming that all the components of H are equal in the two phases; 
this formulation may be appropriate, for instance, in the treatment of curved domain 
boundaries. 

First of all, the general magnetostatic conditions (29.13) must be satisfied: 

Bln = Bjn, H„ = H2,; (44.4) 

these follow from Maxwell’s equations div B = 0, curl H = 0. In addition, the thermody¬ 
namic condition must be satisfied which expresses the equilibrium with regard to a 
movement of the interface (in the direction normal to it), i.e. with regard to a transfer of 
matter from one phase to the other. This condition is represented by the equality in the two 
phases of the thermodynamic potentials relative to and H,.f To find these, it is sufficient 
to rewrite the product — B-dH in the expression for the differential dF(31.6) (again 
neglecting magnetostriction) as 

-BdH = -B,dH,-B„d/f„ 

= -d(B„/f„)-B,-dH,-l-H„dB„. 

From this it is clear that the required thermodynamic potential F' is 

F = F + B„HJ4n, (44.5) 

and the boundary condition sought is 


F\ = F\ (44.6) 

(I. A. Privorotskir and M.Ya. Azbel’, 1969). 


PROBLEM 

Determine the energy of the magnetic field near the surface of a ferromagnet at which plane-parallel domains 
perpendicular to the surface emerge without change in the direction of magnetization (Fig. 23a). 

Solution. The problem of determining the magnetic field near such a surface is equivalent to the 
electrostatic problem of the field due to a plane divided into strips charged alternately positively and negativelj 
with surface charge density a = + M. 

Let the surface of the body be the plane z = 0, and let the x-axis be perpendicular to the plane of the domains. 
The “surface charge density” n(x) is a periodic function with period 2a (a being the width of the domains), and its 


t The idea of domain branching and the result that in the limit were first put forward by L. D. Landau 

for the intermediate state of superconductors (§57). 

f This statement is justified in a similar way to the derivation of the ordinary condition of phase equilibrium, 
the equality of the chemical potentials of the two phases, in which the independent variables are the temperature 
and the pressure, these being equal in the two phases; see SP 1, §81. 
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,j[ue in a typical period is <r = — M for —a<x<0,(T = +M for 0 < x < a. Its expansion in Fourier series is 
" (2n + l)7tx 

<r(x) = ^ c„sin—-, c„ = 4M/(2n + l)7t. 

fhe ficW potential satisfies Laplace’s equation 

~ + = 0- 

dx^ “ ’ 


* (2n+l)7tx - 

(j>{x.z)= X *rSin- e + 


vfhere the two signs in the exponent relate to the half-spaces z > 0 and z < 0. The coefficients f)„ are given by the 
boundary condition 

-[d0/dz],,o+ + [#/5z], = o- =4tto-. 

whence b„ = 2ac„l{2n + 1). 

The required field energy can be calculated as the integral ^ J rr 0 d/over the “charged surface”. The energy per 

‘^|[tr0L.„dx = ^|^cA 


SaM^ 


,to(2n+l)" rr" ' ^ 


With the zeta function C (3) = 1.202, the result is equal to 0.852 aM^. The width of the domains in the plate is 
found by minimizing the sum 1.7 aM^ + tSl/a, the first term being the energy of emergence of domains on both 
sides of the plate, and the second term the surface energy. Hence a = 0.S^ (A/) (C. Kittel, 1946). 


§45. Single-domain particles 

As the dimensions of the body decrease, the formation of any domains at all ultimately 
becomes thermodynamically unfavourable, so that sufficiently small ferromagnetic 
particles are uniformly magnetized single domains. The criterion giving their dimension / is 
obtained by comparing the magnetic energy of a uniformly magnetized particle with the 
non-uniformity energy which would result if there were considerable non-uniformity in 
0 distribution of the magnetization over its volume. The former energy is of the order 
0 L, and the latter of the order of aM^ V/l^. The condition for a single domain to be 
'ormed is thereforet 

/ < ya. (45.1) 

Jn order to ascertain the behaviour of a uniformly magnetized particle in an external 
field- we must consider its total free energy, substituting in (32.7) for F the sum of 
•1) and the anisotropy energyrf 

^ = bf^aniso - i M • J (H -b §) d F. (45.2) 

integration being taken only over the volume of the body, and the unimportant 


The term micromagneiism is sometimes used to refer to the properties of assemblies of such particles, 
free "^?Fctmg the magnetostriction, we make no distinction between the thermodynamic potential and the 
Energy, considering the latter for a given volume F of the body. 
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constant being omitted. Let the particle be ellipsoidal. Then the field H within it is 
given by (29.14). or 

Hi = — 47rn;^M^; (45.3) 

the second term here is the “demagnetizing field” due to the body. We thus find 

^ = Inna, M; L- LM • § + (45.4) 

The first term is called the intrinsic magnetostatic energy of the magnetized particle; the 
second term is the energy of the particle in the external field. 

The direction of magnetization of the particle in the external field § is determined by the 
condition that # be a minimum as a function of the direction of M. For a cubic crystal, the 
anisotropy energy may be neglected in (45.4). For a uniaxial crystal, writing the anisotropy 
energy in the form ^ M; M^, we have 

^=^y(4nni„ + /9JM;M,- F§• M. (45.5) 

The problem thus formulated is mathematically identical with the one in §41 concerning 
the dependence of the local magnetization M on the local field H, the only difference being 
that H is replaced by and by 47rn;^ or by 47rn;^ + 

Lastly, let us derive the equation which must be satisfied by the distribution of the 
magnetization in a single-domain sample, under conditions such that this distribution 
cannot be supposed uniform. To do so, we must require that the total free energy of the 
body be a minimum; we write it as the integral 

# = J f dF 

= j{fo(A^)+L„„„.^,-l-t/a„i,<,-M-H-HV87r}dF, (45.6) 

taken over all space. The variation is with respect to M, which is here a function of the 
coordinates, the value of H being specified at every point; the magnitude of M is fixed, and 
only its direction varies. Omitting from the integrand terms which depend only on M or on 
H, we vary the integral 

which is now taken only over the volume of the body (where M 0). Integrating the first 
term by parts after the variation, we find 

d/i; (45.7) 

the second integral is taken over the surface of the body. Since = 1, we have 
m • ^ m = 0, i.e. the variation is of the form ^ m = ^ co x m, where ^ co is an arbitrary (small) 
function of the coordinates. The condition 6^ = 0 gives the required equation! when the 


t It is, of course, the same as the equation of motion (precession) of the magnetic moment in a ferromagnet.if 
the rate of change ^M/St is put equal to zero (see SP 2, §69). 
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coefficient of ^ co in the volume integrand is put equal to zero: 

Equating the surface integral to zero, we obtain the boundary condition on this equatiorr, 
for example, when the condition is 

mx2m/dn = 0, (45.9) 

where n is the direction of the normal to the surface of the body. 

Together with (45.8), Maxwell’s equations 

div (H + 47rMm) = 0, curl H = 0 (45.10) 

must of course be satisfied in all space, with the usual boundary conditions on the surface 
of the body and the condition H ^ § at infinity.f. 

For a uniformly magnetized (ellipsoidal) body the first term in the parentheses in (45.8) is 
zero. The remaining equation, with H from (45.3), is the same as the condition for a 
minimum of the free energy (45.5). 


§46. Orientational transitions 


The anisotropy constant of a ferromagnet is a function of temperature, and so can 
change sign at any point. The direction of the spontaneous magnetization then changes also, 
and therefore so does the symmetry of the magnetic structure. The resulting transitions 
between different phases of a magnetic substance are called orientational transitions. Let us 
see how such transitions occur in a uniaxial hexagonal ferromagnetic crystal (H. Horner 
and C. M. Varma, 1968). 

Since we have in mind to consider the neighbourhood of a point where the anisotropy 
constant K j becomes zero, it is necessary to take into account also the next term in the 
expansion of the anisotropy energy; for a hexagonal ferromagnet, such an expression for 
^aniso is given by (40.3). 

Let us first suppose that K 2 > 0. The minimum of l/aniso then corresponds to the 
following phases, depending on the values of and K 2 : 


(I) 6 = 0,71 for > 0, 

(II) sin e = ± y (-KJ 2 K 2 ) for - 2 K 2 <K^<0, 
(III) e = i7rforK, < - 2 K 2 . 


(46.1) 


^ses I and III are of the easy axis and easy plane type respectively. In phase II, the 
“magnetization vector does not have a fixed orientation as in phases I and III; as the 
“J^perature varies, its direction changes continuously between 6 = 0 (or tz) and 6 = \n. 
“ symmetry of this phase (sometimes called the angular phase) is lower than that of both 


th^ question may arise whether it is correct to vary the integral (45.6) with respect to m for constant H, even 
j.°Ugh these are related by the first equation (45.10). In fact, however, if we put (from the second equation) 
= -grad <j> and calculate the variation of the integral with respect to <j>, the result is zero by virtue of the first 
Station. Thus the variation with respect to H gives no contribution to 
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phase I and phase III. The transitions between phases I and II, and II and III, take place as 
second-order phase transitions at temperatures Ty and T2 given by the conditions 

K 1 (Ti) = 0, Ky (Tf) + 2K 2 (TJ = 0. (46.2) 

We shall take the particular case where Ky > 0 for T > Tj. Then T2 < Ty if K2 > 0. 

In the presence of a magnetic field, the thermodynamic potential is 

Kysm^e + K2sm'^e-H-M: (46.3) 

Only the terms which depend on the direction of M are shown here. Near the transition 
point between phases I and II, the order parameter is the small quantity sinO = 6 = rj. In 
this range, the thermodynamic potential in a weak transverse magnetic field H = FI^ is 

$ = Kyg^ + K2t]'^-MH^g, 

with Ky = constant x (T—Ty). In the usual way (cf. §39), we deduce that at the transition 
point the transverse susceptibility 

tends to infinity in a similar manner to that given by (39.6) and (39.7). Analogously, near the 
transition point between phases II and III, the order parameter is the small angle 
rj = f 71 — 6. In a longitudinal field H = H^, the thermodynamic potential contains a term 
— MH^g, and at the transition point the longitudinal susceptibility x,, becomes infinite. 

The above argument is based on the Landau theory of phase transitions. For 
orientational transitions, this theory is valid almost without restrictions. The closeness to 
the transition point that is allowable in the Landau theory is determined by a condition (see 
(47.1) below) having in the denominator the cube of the coefficient a in the term (43.1) of the 
thermodynamic potential, which arises from the non-uniform distribution of the order 
parameter. In the present case, this term is due to the exchange interactions in the 
ferromagnet, whereas the expansion terms relative to the parameter g itself are due to the 
relativistic interactions. This is the cause of the extreme narrowness of the temperature 
range near the transition point within which the Landau theory is not valid. 

Now let K2 < 0. Then phase II is unstable has a maximum and not a minimum), 

so that the easy-axis phase I must pass directly into the easy-plane phase III. It is clear from 
the start that this cannot be a second-order transition, since neither of the symmetry 
groups of phases I and III is a sub-group of the other. The transition takes place as a first- 
order phase transition at the point T =Tq which is given by the equality of the 
thermodynamic potentials in the two phases (i.e., of the values of Laniso)- 

(7’o)-t-K2 (7’o) = 0. (46.4) 

The point To lies between Ty and T2 as given by equations (46.2) (where now T2 > Ty). In 
this case, the temperatures Ty and T2 determine the limits of metastability of phases I and 11 
respectively; beyond these limits, has a maximum and not a minimum at 6 = 0 or|;i. 

Orientational transitions can occur not only when the temperature varies (spontaneous 
transitions) but also when the magnetic field applied to the body changes (field-induced 
transitions). The transition points occupy curves in the (H,T) phase diagrams (for a given 
crystallographic orientation of H). As an example, let us consider the phase diagrams of the 
same uniaxial ferromagnet in a field parallel to the hexagonal axis. 

A longitudinal field does not affect the symmetry of the easy-axis phase {EAP in Fig. 25). 
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The easy-plane phase, however, becomes an angular phase (AP), since the field moves the 
magnetization M out of the basal plane. 

Let us first consider the case where K2 > 0. The regions of the two phases are separated 
by second-order phase transition curves beginning from on the abscissa axis (dashed 
curves in Fig. 25a). The upper and lower parts of the diagram correspond to two opposite 
directions of the longitudinal field, and accordingly to opposite signs of the longitudinal 
component M^. Near the curve the angle 6 is small (near A'T^, n-6is small). As far as 

terms of the fourth order in 6, we have from (46.3) 

a = (Ki -I- A MH,) + (^2 - i ^ MH^) e*. (46.5) 

The equation of the curve AT^ is found by equating to zero the coefficient of 6^: 

Ki(T) + ^MH^ = 0 (46.6) 

(^1 < 0 when T <Ti)\ the curve Ti is obviously given by the same equation with the sign 
of the second term reversed, and is symmetrical with respect to the curve AT^. 

The segment T2 of the abscissa axis is a first-order phase transition line, on which two 
phases with opposite signs of are in equilibrium. The second-order phase transition 
which occurs at T2 in the absence of the field disappears when the field is present; T2 is a 
critical point in the (H,T) phase diagram, at which a first-order transition line terminates. 

r'other termination point of this line is the Curie point 7^ (at which the magnetization 
banishes when = 0). 

, ^hen K2 <0 (Fig. 25b), the initial part of the boundary ABT^ between the two phases in 
^ c (H,T) diagram (and similarly A' B'Y^) is a first-order phase transition curve (the 
ontinuous curve BTo), on either side of which are regions of metastability of the two 
Phases bounded by the dashed curves BTj and BT2.f At B (a tricriiical point) the first-order 
cansition curve becomes a second-order one (the dashed curve BA, whose equation is 
^•6)). The coordinates of this point are given by the simultaneous vanishing of the 


f It is assumed that the interface between two coexistent phases is parallel to the magnetic field, so that is 
fitinuous at this interface. 
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coefficients of 6^ and 6* in the thermodynamic potential (46.5), i.e. by the equations! 

MH^ = 2 1 (T) I, Ki (T) = 4^2 (T). (46.7) 

Lasdy, the segment T(,T^ is a line of first-order transitions between phases with oppositely 
directed magnetizations M^= ±M. 


§47. Fluctuations in ferromagnets 

The Landau theory of phase transitions, on which the discussion in §39 was based, does 
not take account of fluctuations of the order parameter, and thus becomes invalid 
sufficiently near the Curie point. The range of applicability of this theory is determined by 
the condition 

\t\/T,>T,B^/aa\ (47.1) |; 

where t = T-f, a and B are the coefficients in the expansion (39.3), (39.4), and a is of the 
order ofthe components of the tensor a;^ in (43.1). We must also, of course, have lt| T^as 
the condition of closeness to the Curie point, f 

When the inequality sign in (47.1) is reversed, i.e. in the fluctuation region, the order 
parameter fluctuations are of decisive importance. The rigorous statistical theory for the | 
Curie point of a ferromagnet in this region would have to be based on the effective j 
Hamiltonian. 

+ B (M^)^ + ^a^^-^^-H-M|dK. (47.2) | 

The function M(r) is here assumed to vary only slowly, in the sense that its Fourier : 
expansion contains only wave numbers much less than the characteristic reciprocal atomic 
distance. The coefficients a, B, and a are the same as the expansion coefficients in the 
Landau theory, i.e. are independent of temperature; for simplicity, the derivative term is 
written on the assumption that the crystal has cubic symmetry, the coefficient being 
denoted by a to distinguish it from the true coefficient a (see below). The effective 
Hamiltonian (47.2) corresponds to the exchange approximation, neglecting the anisotrop) 
energy; in that approximation, we should also neglect the fluctuations of the magnetic e 
that are due to those of the magnetization (i.e. the magnetostatic energy of the fluctuations, 
cf. SP 2, §70). The exchange approximation is characterized by a “degeneracy”: the or 
parameter M has three components, but the effective Hamiltonian is invariant when | 
vector is rotated through the same angle in all space. 

To describe the behaviour of the thermodynamic quantities near a second-order p 
transition, use is made of critical indices, defined in a way which will be repeated here a ^ 
applies to the Curie point in a ferromagnet. The index a describes the tempera 
dependence of the specific heat on either side of the Curie point in the a sence 



in the (H, T) phase diagram, we can apply the t 
critical point in the fT-plane. 
the results given in SP 1, § § 146-149. The reac 
tioned afresh only to make the discussion moi 


jlts derived in SP 1, § 150 (within 

is assumed to be familiar with the*^' 
oherent. The appropriate reference 
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CpOcUr^for H = 0. 


(47.3) 


fhe index /9 describes the temperature dependence of the spontaneous magnetization 
heiow the Curie point: 

M oc (- tf for r < 0, H = 0. (47.4) 

fhe index y describes the dependence on t of the magnetic susceptibility in the 
■limagnetic phase: 

X oc for t > 0, H = 0; (47.5) 


see below concerning the behaviour of the susceptibility when t < 0. The field dependence 
of the magnetization at the Curie point itself is written 

MocHi'^forf = 0. (47.6) 


The temperature dependence of the correlation radius of the magnetization fluctuations 
is described by the index v: 

r,oc|tr"forH = 0. (47.7) 

At the Curie point itself, the correlation function is a decreasing power function of the 
distance:f 

G;, (r) = <^ M, (0) ^ JVfj (r) > oc r- for t = 0, H = 0. (47.8) 

The critical indices are connected by certain relations, some of which follow from the 
hypothesis of scale invariance. These relations are universal ones, and in particular do not 
depend on the number of components of the order parameter; they enable all the above- 
mentioned indices to be expressed in terms of any two of them. 

Table 2 gives the values of the critical indices for a ferromagnetic transition. For 
comparison, we give also the values for other degrees of degeneracy n (where n is the 
number of components of the order parameter M in the effective Hamiltonian in the form 
(47.2); n = 3 for a three-dimensional ferromagnet in the exchange approximation). § 

In the paramagnetic phase, the correlation function decreases exponentially at distances 
^ ^ r^. In the ferromagnetic phase, we have to distinguish fluctuations with and without a 
change in the magnitude of M. The degeneracy of the Curie point problem in the exchange 
approximation makes its appearance here. 

The difference does not arise when the change in M, including the change in its direction, 
*akes places over short distances r r^; the correlation function then has the same 
behaviour (47.8) for all fluctuations. At distances r > (wave numbers such that kr^ < 1), 


Table 2 


P y V 6 C 


1 0.110 0.325 1.240 0.630 0.48 0.031 

2 - 0.007 0.346 1.315 0.669 0.48 0.033 

3 -0.115 0.364 1.387 0.705 0.48 0.033 


This refers to the “singular” part of the specific heat: when a < 0, the specific heat C = C „ + 
“nstant x |fll«l. The index a is not to be confused with the coefficient in (47.1), (47.2) and below. " " 

I We are everywhere considering only ordinary three-dimensional bodies. 

S The values of the small indices a and C are taken from numerical calculations (J. C. Le Guillou and J Zinn- 
*'stin, Physical Review B 21, 3976, 1980). The others have been calculated from these two. 
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the fluctuations of the direction of M increase “anomalously” because of the decreased 
amount of energy needed for this departure from equilibrium; for a rotation of the | 
magnetization uniform throughout the crystal, /c -► 0, and no energy at all is needed. The | 
correlation function of the fluctuations of the direction of M at such distances can be | 
derived thermodynamically. 

Since M varies slowly, we can separate in the thermodynamic potential the terms related 
to this variation; 

, fSM SM 

= + (47.9, 


where gPo applies to a uniformly magnetized body.t It must be emphasized that we are here f 
discussing the true thermodynamic potential, and the coefficient a, a function of | 
temperature, is not the same as a in the effective Hamiltonian. 

In a small rotation of M (without change in magnitude), the term gPo (T, M) is unaltered, j 
If some particular direction of IVI is chosen as the z-axis, a small deviation from this | 
direction can be described by a small two-dimensional vector d in the xy-plane. 

The corresponding change in the thermodynamic potential ist 


Substituting as a Fourier series 




Scp = ^V<xY^k^\SM^^ 


and the mean square fluctuation 

> = {T/Vak^) ^ I 

where a and /9 are vector suffixes in the xy-plane. The corresponding coordinate correlatio | 
function is § //i7l2) 

G^,{r) = {TI4nar)3^,. 


Thus the correlation function of the fluctuations of the direction of M m t ^ 
ferromagnetic phase decreases according to a low power law (~ 1 r) at distances r r 
whereas the correlation function of the fluctuations of the magnitude M decreases 
rapidly. | 


t Strictly speaking, we should 
according to the theorem of small 
potential, and the potential will tl 
t The calculations below are e 


§ Formula (47.12) can also be derived ii 
closeness to the Curie point; far from the li 
dimensions (see SP 2, §71, Problem 4). Neglec 
on the range of validity of (47.12). For exam 


we should use here the thermodynamic potential O not 
m of small increments, the derivative term concerned here has the same form 
;ntial will therefore be written as go. 

7elow are exactly analogous to 'hose 'n SP 1 §1 ■ indenendently of 
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Using the hypothesis of scale invariance, we can now determine the temperature 
dependence of the thermodynamic quantity a, expressing it in terms of the critical indices 
defined above. As already mentioned, when r the correlation functions of all 
fluctuations of the magnetization, including those of its direction, have the same behaviour 
(47.8). One feature of scale invariance is precisely that the characteristic distance at which 
one limiting form changes to the other is r^. That is, when r ~ the two forms should give 
the same order of magnitude of G. When r ~ r^, the temperature dependence of G from 
(47.7) and (47.8) is 

Gocr;'i+«oc (-t)^'^+«. 


According to (47.12), 


Goc(arJ '~a ^(-f)^ 


Comparison of these two expressions gives 

a~(-t)-^« (47.13) 

(P. C. Hohenberg and P. C. Martin, 1965). Thus, as T ^ a tends to infinity, though only 
slowly (since C is small). In the Landau theory, it tends to a finite non-zero limit.t 
When H A 0, the thermodynamic potential (47.9) must contain a further term 
- j HM^ d V (when a field is present, the mean direction of M, i.e. the z-axis, is of course the 
same as the direction of H). In a fluctuation of the direction of M with = constant, we 

have 

2M6M^ + SM^_^ =0. 

The fluctuation therefore causes the expression (47.10) for the change in cp to have an 
additional term 

-\HdM^dV= -{H/2M)^ {dMJ^dV. 

It is clear that this causes ak^ in (47.11) to be replaced by ak^ + H/M, so that 


iSM^SMi,y = 


V{ak^ + H/M) 


(47.14) 


These formulae yield the susceptibility of the ferromagnetic phase, i.e. the derivative 


( 47 . 15 ) 

Substituting < (^M^)^ > from (47.14) and changing from summation over k to integration, 
we find 

_ r r 1 Vd^k 
^ ~ VM^ J (ak^ + H/Mf (271)3 ’ 

^nd finally, after evaluating the integral, J 

X = T/Sn (o!M)3/2 hW2. (47.16) 


It must be emphasized that the possibility of determining the function a (t) depends on the degeneracy of the 
Problem. For the same reason it is possible to determine the temperature dependence of the superfluid density p 
"quid helium near the A-point; in that case, the degree of degeneracy n = 2 (see SP 2, §28). The quantities aM^ 
*ud p^ play an analogous role in the two cases. 

1 The integral is governed by the range ~ for sufficiently small H, this is compatible with the 

r^ondition kr^ « 1 for (47.14) to be valid. 
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We see that, in the exchange approximation, the fluctuations of the direction of M deprive 
the susceptibility x in the ferromagnetic phase of its literal significance, as it increases 
without limit as H-rO. Formula (47.16) is applicable not only near the Curie point.t 

By M in the denominator of (47.16) is to be understood its value for H = 0. In the 
fluctuation range (near the Curie point), taking the temperature dependences of M and q 
from (47.4) and (47.13), we find that in the ferromagnetic phase 

Xoc(--t)-3(P-vC)/2^-i/2^ t<0. (47.17) 

In the (temperature) range where the Landau theory is valid, there is a range of values of 
H for which the ordinary term (39.7) predominates in the susceptibility. For, substituting 
into (47.16) M ~ {-at/Bf^, we obtain as the condition for this predominance 

In the Landau theory, H may be considered weak if MH AM^, i.e. H (a|f|)^'^B“ 

The condition for these two inequalities to be compatible is the same as the condition (47.1) 
for the Landau theory to be valid. 

The exchange approximation becomes invalid in the immediate neighbourhood of the 
Curie point, where the anisotropy energy becomes considerable on account of the decrease 
in the exchange energy. The number of order parameter components then varies; for 
instance, it decreases to one in a uniaxial ferromagnet of the easy-axis type (M^ in place of 
M; cf. the end of §40). In the Landau theory, this circumstance does not affect the form of 
the temperature dependence of the spontaneous magnetization and the magnetic 
susceptibility. In the fluctuation range, however, it is important: the fluctuations of M, 
increase without limit, but those of and My remain finite. This alters the values of the 
critical indices and causes temperature dependence of the anisotropy coefficients. The 
problem is further complicated by the fact that it may be necessary to take into account 
also the magnetostatic energy of the magnetic field fluctuations; we shall not pause here to 
discuss this complex situation. 


§48. Antiferromagnets near the Curie point 

Like ferromagnetism, an antiferromagnetic structure is established basically by 
isotropic exchange interaction of electrons; the weaker relativistic interactions determine 
the crystallographic orientation of the magnetizations of the sub-lattices.| The anU 
ferromagnets now known are extremely varied in structure and therefore in their particu at 
magnetic properties. r 

As an illustration, we shall consider only a simple but typical and important case, that 
a uniaxial antiferromagnet with two antiparallel magnetic sub-lattices. The atoms in t es 
sub-lattices occupy equivalent lattice sites (that is, the symmetry elements of the magne ^ 
space group include rotations or reflections which interchange the atoms of the difleren 


t It can also be derived in spin-wave theory (see 2, §71, Problem 2). Neglecting the magnetic anisotrc^y 

the magnetostatic energy of the fluctuations, however, restrtcts its validity in accordance with the conditi 

uniaxial crystals)^/?M or^ 47t M. . • . u i .. ,;„!iralld 

± The idea that the exchange interaction may produce a state in which the sub-lattices have antipa 
magnetic moments was first put forward by L. Neel (1932). Independently, a similar^^ was suggested by L- 
Landau (1933), who formulated the concept of the antiferromagnetic state as a thermodynamic phase difTe ^ 
from the paramagnetic one, and of the necessary existence of a point of transition between them. This will 
called the antijerromagnetic Curie point. 
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j^b-lattices); otherwise, symmetry would not require strict equality of the magnitudes of 
the sub-lattice magnetic moments, and the crystal would be ferromagnetic. 

Let Ml and M2 be the magnetic moments of the two sub-lattices per unit volume. We 
take as the order parameter, equal to zero in the paramagnetic phase but not in the 
gntiferromagnetic phase, the difference 

L = Mi-M 2, (48.1) 

called the antiferromagnetic vector. The magnetization, equal to zero in the absence of a 
^lagnetic field, is the sum M = Mj -I-M2. 

In relation to L, the symmetry transformations fall into two classes: those which 
interchange atoms only within the same sub-lattice, and those which interchange atoms in 
different sub-lattices. Then L is transformed simply as an axial vector for the first class, but 
in addition changes sign for the second class. 

Near the Curie point, L is small. In the Landau theory, the thermodynamic potential C> 
in this range is expanded in powers of L and M, as first discussed by Landau in 1933. 
However, since the magnetization occurs only when the field H is present, it would be more 
correct to expand in terms of L and H immediately. For a uniaxial crystal, such an 
expansion is 

^ = %+AL^ + BL^ + D(Ii-Lf+iyH^L^ 

+ L/) (48.2) 

The first five terms (after $0) independent of the crystallographic orientation of the 
vectors H and L; they are of exchange origin. The next two terms are due to the relativistic 
interactions; the z-axis is, as usual, taken to be along the principal axis of symmetry of the 
crystal. A term H • L linear in H is forbidden by the requirement of invariance under 
transformations that change the sign of L. 

If ^ > 0, the antiferromagnetic vector L is along the z-axis (an antiferromagnet of the 
easy-axis type). If j6 < 0, L lies in the basal plane (an antiferromagnet of the easy-plane 
lype). In the former case, the antiferromagnetism corresponds to the vanishing of the 
function A{T) (the coefficient of L/); in the latter case, to the vanishing of A+jfi (the 
coefficient of + L^^). 

We shall consider an easy-axis antiferromagnet (P > 0). Near the Curie point, we put as 
usual A = a {T— TJ, and take B to have its value for T = then B > 0 is the condition for 
f e state with L = 0 to be stable at T = T^. In the paramagnetic phase, A> 0 and L = 0. In 
c antiferromagnetic phase, A <0, and minimizing the potential C> for H = 0 gives the 
sual Landau-theory temperature dependence of L: 

L=^[a{T^-T)/2Bl (48.3) 

I^ifferentiating the potential (48.2), using the formula 

d^/dH = - B'4n= - H/47t - M, 

"'c find for L = 0, i.e. in the paramagnetic phase,t 

Mx = (Xp + My = (Xp + y)Hy, = Xp H^. (48.4) 

. f The term — is separated in (48.2) so that the differentiation of the remaining terms ■ 

®ves just M. 


nth respect to H 
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The constant y makes the magnetic susceptibility anisotropic in this phase. Because of its 
relativistic origin, we have | y | <^ This constant will be neglected in what follows, so that 
Xp will be the isotropic susceptibility for T > 

In the antiferromagnetic phase, when H 0 (i.e. when the field dependence of the 
equilibrium value of L is neglected), we have 

M = ;(pH-2Z)L(L-H)-2Z)'L^H. (48.5) 

If the field is perpendicular to L, then 

M = ZxH. x± = Xp-2D'L^ 

= X,-{D'a/B)(T^-T). (48.6) 

In a longitudinal field, 

M = z,,H, x, = X±-2DL^ 

= Xp-{D + D')a (T; - T)/B. (48.7) 

The susceptibility is anisotropic and remains so even when the relativistic interactions are 

neglected, i.e. the anisotropy is of exchange origin. 

It must also be emphasized that at the Curie point itself the susceptibility remains finite 
and continuous, but its first derivatives are discontinuous. There is an important difference 
here from a ferromagnet, for which the susceptibility becomes infinite at the transition 
point. This difference between the ferromagnetic and antiferromagnetic Curie points is 
closely related to the difference in the way they are affected by the presence of a magnetic 
field. In a ferromagnet, any field, however weak, blurs the transition, since by magnetizing 
the paramagnetic phase it removes the difference in symmetry of the two phases. 
Antiferromagnetic order, however, cannot be created by a magnetic field; the difference in 
symmetry between the two phases is maintained even in the presence of a field, and the 
transition remains sharp. 

With the accuracy allowed by the expansion (48.2), the coefficients D and D' in (48.6) and 
(48.7) may be taken as having their values at F = F^. The discontinuities in the derivatives of 
the susceptibility are thereforef 

^Xp ^Xii _ a(D + D') (4gg) 

dT 8T B ’ dT dT B 

Let us now return to the expression (48.2) for O. The terms with the coefficients D and § 
depend on the direction of L. We shall assume that D > 0 (i.e. x±> X ii)> before that 

)S > 0 (an easy-axis antiferromagnet). If the magnetic field is perpendicular to the z-axis, t e 
form of these terms shows that the minimum of 3> corresponds to L^= L^ = 0, i.e. L 
always in the z-direction. If, however, the field is also along the z-axis, we see that, when t ^ 
magnetic energy in the field (the first of the terms mentioned) becomes comparab e > 
magnitude with the anisotropy energy, there should be a change in the direction o 


t In all known cases, the substance becomes paramagnetic at the antiferromagnetic Cune point, i 
sign of Xp cannot, however, be deduced front therrnodynamic ^rgumems^atone.^^ 


1 Noleneral conclusion as to the sign of D and D' can be drawn 
practice, the discontinuity of <5y ,,/dr is always negative; this means that 0 + 0 > O That of dxJcT is usual', 
negative’ also, and near the Curie point > Z.; this means that D > 0 and V > V. 
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^^Ihich rotates into the basal plane. This spin flop of the sub-lattices occurs abruptly at a 
^ain field value H = (L. Neel. 1936), a result proved as follows. The terms in question 

jj, (48.2) may be written in the form 

Hfl DL^ + L^{- DHfl +^p) sm% 

where 6 is the angle between L and the z-axis. It is evident that the minimum of 3> occurs for 
g = 0 if < Hf, where 

H/=P/2D = PLV{X^-X,). (48.9) 

If U^> Hf, however, the value corresponding to equilibrium is 6 = ire, and L is 
perpendicular to the z-axis. A similar situation continues to exist far from the Curie point. 
The flop field Hj^ depends on the temperature, and the above discussion shows that the 
curve H = Hf (T) in the TH-plane is a curve of first-order phase transitions.! 

In sufficiently strong magnetic fields, the antiferromagnetic structure cannot be 
thermodynamically stable, since the parallel orientation along the field becomes energeti¬ 
cally favourable for the magnetic moments of the two sub-lattices. The disappearance of 
the antiferromagnetic structure involves a change in symmetry and takes place by a second- 
order phase transition. In the TH-plane, therefore, the region in which the antiferro¬ 
magnetic phase exists is bounded by a curve H = HflT). The loss of antiferromagnetism 
must occur when the magnetic energy in the field becomes comparable with the exchange 
energy. Far from the Curie point, the critical field can be estimated in order of magnitude as 
~ where p is the atomic magnetic moment. As the Curie point is approached, 
decreases, and becomes zero at that point. The function HflT) in this range is easily 
determined by means of the same expression (48.2) for the thermodynamic potential. 

It has been shown above that, if the field H is perpendicular to the z-axis, L is always 
parallel to that axis. The L-dependent terms in the thermodynamic potential are 

AL^ + BL^ + D'H^L^. (48.10) 

From this we see that the presence of the magnetic field replaces the coefficient A by 
4 + D'H^. and the latter is zero at the new transition point. The critical field is therefore 
given by 

Hfl = - A/D' = a{fl-T)/D'. (48.11) 

If H is parallel to the z-axis. for H <Hf the vector L is again along that axis, but the L- 
dopendent terms in (48.2) differ from (48.10) in that D' is replaced by D-\-D'. In this case, 
‘herefore, 

H/=^(7',-n (48.12) 


Lastly, if H > Hy the vector L is perpendicular to the z-axis: in this case, we similarly find 
®r the critical field! 

H,^=~(T,~T-p/2a). (48.13) 




Figure 26 shows the (F, H^) phase diagram for an antiferromagnet near the Curie point 
for both the field directions considered. The dashed lines are second-order transitions, and 
the continuous line a first-order one; P denotes the paramagnetic phase, AFf and the 
antiferromagnetic phases with L respectively parallel and perpendicular to the z-axis. In 
the Landau theory (which is used for the whole treatment here) the diagrams consist of 
straight lines. For a longitudinal field there is a bicritical point {b in Fig. 26b) where the 
first-order phase transition line ends on a second-order one. This point is analogous to the 
bicritical point in the PF-plane {SP 1, §150. Fig. 67). Its coordinates are 

F, = F ,-= P/2D (48.14) 

2a D 

where P, D, D' are the values of the coefficients for F = F^. 


§49. The bicritical point for an antiferromagnet 

The Landau theory used in §48 becomes inapplicable, as usual, in the immediate 
neighbourhood of second-order transition curves, i.e. in the fluctuation range. Let us 
consider this range near a bicritical point in the phase diagram of a uniaxial anti¬ 
ferromagnet (of the easy-axis type) in a longitudinal magnetic field. 

The effective Hamiltonian for this problem is 




+ BL* + DL^{H/ - H,^) sin^e + 


The integrand consists of the L-dependent terms in the expansion (48.2), with A 
(F-FJg, the notation F^ and from (48.9) and (48.14), and the addition of a gradi 
term. This Hamiltonian has the same form as for a uniaxial ferromagnet (not in a magnj 


field), differing only in the notation: L replaces M, the expression in the square brad 


replaces the anisotropy constant p of the ferromagnet. When Hj, this constant i- 
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^ero, and (49.1) reduces to 

+ (49.3) 

^^Ihich is formally the same as the effective Hamiltonian of a ferromagnet in the exchange 
approximation (47.2). These analogies enable us to elucidate a number of properties of an 
^tiferromagnet near a bicritical point, by using the known results for a ferromagnet 
(ivl. E. Fisher and D. R. Nelson, 1974). 

The equation = Hj corresponds to a first-order phase transition curve (spin flop). 
The analogy leads to the conclusion that the thermodynamic properties of the anti¬ 
ferromagnet on this curve near the bicritical point are the same (with appropriate 
ignterpretation of the quantities) as those of a purely exchange ferromagnet near its Curie 
point. In particular, as the bicritical point is approached along this curve, the anti- 
faromagnetic vector tends to zero in accordance with 

Loi{T^-Tf, (49.4) 

where P is the same as in (47.4).t 

Near the bicritical point but off the first-order transition curve, the parameter u is small 
but not zero. However small the value of u, its importance increases as the second-order 
transition curve is approached (see the comment at the end of §47): the number of 
components of the order parameter decreases to one (L^ in the AF^ phase) or two {L^ and 

in the AF^ phase). As w becomes smaller, so does the region in which it is important. The 
change from n = 3ton = lor2 takes place via an intermediate range. It is plausible to 
assume scale invariance in that range: only the scale of measurement of u changes as the 
bicritical point is approached. Accordingly, there is a new cross-over index (p: as the scale of 
t~T~T^ changes, that of u varies as 1 1 1*. The index </> must be positive, since any value of u, 
however small, affects the nature of the transition. 

In using this hypothesis, we must assume that the phase transition curves near the 
^critical point are given by constant values of x = u/111*. The first-order transition curves 
correspond to x = 0, the second-order ones between P and AF^ to some Xj > 0, and those 
between P and AF^^ to some X2 < 0. By in the definition (49.2) we must now 

Understand, of course, the actual function that would give the exact solution of the 

statistical problem with the effective Hamiltonian (49.1). Expanding it in powers of t and 
emitting from u the constant factor 2H^, we find the variable u near the bicritical point as 

u = H,-H, + c(T-T,), (49.5) 

"'here c is a constant. The equation of the first-order transition curve is 
H,-H, = c(T-T,), 

®ud those of the two second-order transition curves are 

H, -H, = ciT- r,) + ci_ 2(49.6) 
"here Ci and C2 are positive constants. When <f> > 1 (numerical estimates give <t> ^ 1.25), 


t The index y in this case has no direct physical significance, since it is related to the field h, which would appear 
a term -h-L in the effective Hamiltonian, but has no real existence in an antiferromagnet. 
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Fig. 27 


these curves have the form shown in Fig. 27 in the fluctuation range; all the curves have a 
common tangent at b. 

The hypothesis of scale invariance near the bicritical point also leads to a number of 
conclusions as to the manner of variation of the magnitude of the antiferromagnetic vector 
L as that point is approached from various directions in the -plane, but we shall not 
pause to discuss these.t 

§50. Weak ferromagnetism 

There are crystals in which the exchange interaction establishes an antiferromagnetic 
structure but the comparatively weak relativistic interactions cause a slight distortion of 
this structure, as a result of which a magnetization M appears which is “anomalously 
small in proportion to the ratio of relativistic to exchange interactions. This is called weak 
ferromagnetism.^ 

The exchange interaction itself allows any orientation of the antiferromagnetic vector L 
in the crystal. A particular crystallographic orientation of L is established only by the 
relativistic interactions described by terms anisotropic with respect to L in the expansion of 
the thermodynamic potential. It may happen that the symmetry of the resulting structure 
would in itself allow the existence of a ferromagnetic moment M also.§ It is in such cases 
that weak ferromagnetism occurs; among the relativistic terms in the expansion of the 
thermodynamic potential, there are some which give rise to the required distortion of the 
antiferromagnetic structure.* This will be shown by means of a characteristic example- 

Let us consider rhombohedral crystals belonging to the space group D^a- As we kno^ 
(see QM, §93), the crystal class contains the following symmetry elements: a threefo 
axis of symmetry C3 (trigonal axis), three twofold axes perpendicular to it (denoted by Uzh 
and a centre of inversion /; in consequence, there are three planes of symmetry (Tj, eac 
passing through the C3 axis and perpendicular to one of the U 2 axes (and thus bisecting t 


t See M E Fisher and D R Nelson, Physical Review Letters 32. 1350, 1974. , 

J With the termhiology described at the end of §37, it would be mote cotrecUy nam^d wea,k fernmagnetistfl ^ 
§ For this, it is always necessary th - -- = = 


It the magnetic unit cell should be the sa 


ts the crystallographic one; ^ 


of weak ferromagnetisr 


I. E. Dzyaloshinskil (1957). 
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angle between the other two). In the space group D%^, the planes become glide planes 
^^Iith a translation of half a period along the trigonal axis. This gives an arrangement of the 
axes and centre of inversion in each unit cell as shown in Fig. 28. The vertical segment is 
one period along the trigonal axis (a spatial diagonal of the rhombohedral cell), whose 
length is arbitrarily taken as unity. The twofold axes pass through the points j and |. 
Xhe centre of inversion occurs at 0 and \ (marked by crosses in Fig. 28). The vertical planes 

are not shown. 

In the antiferromagnets FeCOj and MnCOj, each unit cell contains two magnetic ions 
(Fe"^ ^ or Mn"^ '^), which occupy positions at the equivalent points 0 and \ on the trigonal 
axis. The exchange interaction establishes a magnetic structure in which the moments of 
these two ions are antiparallel. In FeCOj the moments of the Fe^ ^ ions are along the 
trigonal axis (Fig. 29). It is easy to see that such a structure is invariant under all 
transformtions of the class and therefore does not allow ferromagnetism: the 
existence of a vector M along the trigonal axis is excluded by the presence of the U2 axes, 
and of one in the basal plane by that of the C3 axis. 

In the antiferromagnet MnCOj, the magnetic moments of the ions lie in the basal (xy) 
plane, which is perpendicular to the trigonal (z) axis (Fig. 30). If the moments lie in one of 
the (Ta planes, which we then take as the xz-plane, the magnetic structure has the symmetry 
elements (in addition to the unit element) U2^^\ I, i.e. belongs to the magnetic class 
which coincides with the ordinary class C2hy it allows the existence of a vector M in the 
y-direction. If, on the other hand, the moments lie along one of the U2 axes, which we then 
take as the x-axis, the magnetic structure has the symmetry elements U2^'‘^R, I, i.e. 

belongs to the magnetic class C2fc(Cj); it too allows the existence of a vector M in the y- 
direction. In either case, M arises from the turning of the moments of the two ions in each 
unit cell to lie oppositely in the xy-plane (Fig. 31). 

Going on to a quantitative theory, we again define vectors M = Mj + M2 and L = 
M, - M2, where the suffixes 1 and 2 refer to the two magnetic sub-lattices. The unit vector 
in the direction of L is denoted by I. 

Let us consider the expansion of the thermodynamic potential $ (for H = 0) in powers 
of M and I. The expansion in powers of M is permissible because this quantity is small in a 



i, 



Fig. 28 Fio. 29 



Fig. 30 
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weak ferromagnet. The expansion of the anisotropy energy in powers of I is based, as 
usual, on the relative smallness of the relativistic interactions. There is thus no assumption 
of nearness to a second-order phase transition point (L small); the theory given here is 
therefore not subject to the limitations inherent in the Landau theory. 

The expansion terms must be invariant under all transformations in the group 0%^. The 
first terms in such an expansion are 

0) = <l>o(L) + BM^ + D(\-MY M^Q, (50.1) 

where is a function isotropic with respect to L. The first two terms (after $o) are of 
exchange origin; here B > 0 (otherwise, there would exist a spontaneous magnetization 
independent of the antiferromagnetism, i.e. the body would be an ordinary exchange 
ferromagnet). The next three terms are of the first order (~ v^jc^) in the relativistic 
interactions.t The last of these may be put in the form Lf • M x |, where C is a vector 
parallel to the z-axis.J 

It is obvious that all the terms in (50.1) except the last are invariant. To test the 
invariance of the last term, it is sufficient to do so with respect to the C3 axis, one of the L/j 
axes, and the inversion /. The invariance under rotations about the trigonal (z) axis is 
evident from the form which shows the z-component of M x |; here it is important that the 
rotations do not interchange atoms between different sub-lattices, so that M and I are 
transformed in the same manner. The invariance under inversion follows from that of M 
and I separately; for M this follows from its being an axial vector, for I we need also to note 
that in the structure considered the inversion interchanges atoms only within each sub¬ 
lattice. The transformation interchanges atoms with oppositely directed moments; 
under such a rotation, therefore, M^, M^, — M^,; If, and the invariance 

of MJy — Myf is obvious. 

We shall suppose that y < 0. Then the vector I is in the basal plane (/^ = 0). Taking as the 
xz-plane the plane containing I, and minimizing C) with respect to M for a given L, we find 
for the ferromagnetic moment 

M, = 0,My= (C/2B)L„ M, = 0. (50.2) 

Since |C| B, M is in fact small. We see that the occurrence of weak ferromagnetism is due 
to the last term in (50.1), which is bilinear in M and I. The close relation between the 
direction of M and the antiferromagnetic structure is characteristic of weak ferromagnet¬ 
ism; in the present case, M is in the same basal plane and is perpendicular to L.§ 

When a field is present, the dependence of the magnetization on H is given by the 
conditions for a minimum of the thermodynamic potential O = $ — M • H — The 

minimization is to be effected with respect to the orientation of the structure in the basal 


t The factors and L in the last two terms are written only in order to make all the coefficients in (50.1) 
dimensionless, and do not signify an expansion in powers of L. 

t The microscopic origin of such a term is due to the spin-antisymmetric interaction which occurs as a second- 
order effect in perturbation theory, the perturbation being the mixed terms bilinear in the exchange and 
relativistic spin-orbit interactions. See T. Moriya, in Magnetism (ed. by G. T. Rado and H. Suhl). Vol. 1. 
Academic Press, New York, 1963, p. 85. 

§ In the approximation corresponding to the thermodynamic potential (50.1), there is no magnetic anisotropy 
in the basal plane, and the direction of L in that plane remains arbitrary. Anisotropy in the basal plane, and 
therefore a particular orientation of L, occur only when higher-order terms (up to the sixth order) are taken into 
account. Terms then also come in which are mixed as regards the z and the x or y components of the vectors, sc 
that the magnetic moments deviate from the basal plane by a small angle. 
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plane and with respect to the components of M. It is evident that, when the magnetic 
anisotropy in the basal plane is neglected, the magnetization rotates to a position such that 
its component in the basal plane is parallel to the field Hj^, and L is accordingly 
perpendicular to Hj^.t The minimization of C) with respect to and then gives 


= M^ = xAH^+HA 

(50.3) 

where the susceptibilities are 


X,= V{2B-P), Zx = l/2B 

(50.4) 

and 


H^=K\L 

(50.5) 

is the Dzyaloshinskii field, i.e. the “effective field” which determines the spontaneous 


magnetization of a weak ferromagnet. Since \P\-4 B, we have ;(|| = Xi- 
Another property of these substances is one which appears when a field H is applied, and 
is found near the point of transition to the paramagnetic phase. In the Landau theory, we 
expand the function $o(^) in this region in powers of L: 

a)o(L) = a)o(O) + A CLf 

Let the vector L be in the positive x-direction. We take the particular case where C > Of; 
then M is in the positive y-direction. The field H is assumed to be in that direction also. The 
thermodynamic potential is 

O = Oo (0) + + CL‘^ + BM^- CLM -HM- H^/Sn; (50.6) 

near the Curie point, the expansion of the anisotropy energy in powers of the unit vector I 
becomes one in powers of L itself. In the absence of a field, M = l^L/2B, and the expansion 
(50.6) becomes 

0) = 0)0 + (^ - ^^/4B)L^ + CL^. 

The Curie point is given by the vanishing of the coefficient of L^; near this point, therefore, 
A~t:^/4B = a(T-r,) 

(a > 0). The remaining coefficients are taken to have their values at T = (with C > 0). 

When the field is present, elimination of M from the equations = 0, dO/cM = 0 

gives the following equation for L: 

2CL^ + a(T-T^)L-CH/4B = 0. (50.7) 

This shows that the magnetic field blurs the phase transition in a weak ferromagnet, as in 
an ordinary one.§ The antiferromagnetic vector also is non-zero on both sides of f \ the 


t It is assumed here that P > 0; when P < 0, the presence of the term - | P | (1 • M)^ with a large coefficient | P | 
may have the result that 1 and Hj^ are no longer at right angles. 

The neglect of the anisotropy in the basal plane imposes certain lower limits on the fields concerned, 
t The sign of C is arbitrary in the sense that it depends on the definition of L = M i — Mj according to which 
magnetic atoms in the crystal are taken as the sub-lattices 1 and 2. But when the choice has been made in a given 
crystal the sign of C is definite, and determines the direction of M with respect to that of L and the crystallographic 

§ Equation (50.7) has the same form as (19.4) for a ferroelectric in an electric field and the corresponding 
equation for an ordinary ferromagnet in a magnetic field. 
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magnetic field creates antiferromagnetic order in the paramagnetic phase, thus removing 
the difference between the symmetry properties of the two phases (A. S. Borovil;, 
Romanov and V. I. Ozhogin, 1960). When T>f,L decreases beyond a certain distance 
from f according to 

L = m/AaBiT-T,). 


§51. Piezomagnetism and the magnetoelectric effect 

The phenomena of piezomagnetism and the magnetoelectric effect in antiferromagnets 
are closely related to the magnetic symmetry. 

Piezomagnetism is the occurrence of magnetization when elastic stresses are applied to 
the crystal; it is analogous to piezoelectricity, and is represented by the presence in the 
thermodynamic potential of the crystal of a term linear both in the field and in the elastic 
stress tensor: 

4>.m= (511) 


where A; is a tensor symmetrical in the suffixes k and /; cf. (17.7). This term causes the 
induction B; = -And^IdH^ to contain a further term AnXi ,,iOu- Thus, when H =0, 
there is a magnetization linear in the stress; 


Another manifestation of the same property is linear magnetostrictu 
occurrence of a strain linear in the field applied to the crystal: 


Time reversal changes the sign of the field H (and of the magnetization M) while leaving 
the tensor unchanged; the thermodynamic potential must also remain unchanged, of 
course. The piezomagnetic tensor «therefore changes sign under time reversal. In turn 
it follows that piezomagnetism is possible only in bodies having a magnetic structure, 
without this, their properties are invariant under the transformation K, and therefore 
A. = -Aj = 0. Piezomagnetism is possible in antiferromagnets belonging to certain 
magnetic symmetry classes which contain R only in combination with rotations or 
reflections, or else do not contain R at all (B. A. Tavger and V. M. Zaitsev, 1956). 

The magnetoelectric effect consists in a linear relation between the magnetic and 
fields in matter; it causes, for example, a magnetization proportional to the electric o 
present (L. D. Landau and E. M. Lifshitz, 1956). This effect is described by a term m 
thermodynamic potential that is linear in both the magnetic field and the electric o* 


an unsymmeti 
magnetization 


trical tensor. When H = 0, the electric field generates in 

" X. (5l5' 


0 the magnetic field generates an electric pola 


ism, the magnetoelectric effect is possible only for certain magn^ 
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»metry classes; the magnetoelectric tensor is odd under time reversal and zero in 
^ies without a magnetic structure. Since E is a polar vector and H an axial one, is 
always zero if the crystal symmetry contains the inversion /; for the magnetoelectric effect 
,0 exist, the inversion can occur only in the combination IR. 


PROBLEMSt 

Problem 1. Find the non-zero components of the piezomagnetic tensor in the antiferromagnet FeCOj 
(vvith the structure shown in Fig. 29). 

^LUTION. As already mentioned in §50, the magnetic class of this structure is the same as the crystal class 
Du does not contain the element R at all. The transformations of this class leave invariant the expression 
3'p„= -A, 

the z-axis being along the threefold symmetry axis, and the x-axis along one of the horizontal twofold axes. (This 
exp^sion for the class in the piezoelectric case has been found in §17, Problem 1; it remains valid in the 
^^^agnetic case for the class = 0^x0^ also, since the spatial inversion / leaves unchanged both the 
tensor a.k of rank two and the axial vector H.) The magnetization is therefore 

= At - (T J - Aj = - 2A, -I- ^2 <T„. 

Problem 2. The same as Problem 1, but for the magnetic class t 

Solution. This class has the elements (besides the unit element) Cj, 2QP, 2U2, R, /, 2a^ 2a\, R- the 

notation is everywhere as in QM, §95. These transformations leave invariant the expression 

= - -^1 K. ffp + ffp. ffj - Aj 

ihe z-axis being along the fourfold symmetry axis, the x and y axes along two horizontal twofold axes. The 
magnetization is therefore 


= A, Mj, = 2] = A2 . 

Problem 3. Find the non-zero components of the magnetoelectric tensor for the antiferromagnet 
chromium sesquioxide (CrjOj). This crystal belongs to the space group (see §50) and contains in each unit 
cellfour chromium atoms at the equivalent points 11 , ^ - 11 , ^ -I- 11 ,1 - u (n < 0 on the trigonal axis; their magnetic 
”»ments are along this axis (Fig. 32). 







Fig. 32 


fh The examples given below are due to I. E. DzyaloshmskiT (1957, 1959) 
f The antiferromagnets manganese difluoride (MnF^) and cobalt difluoride (CoFj) are of 
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Solution. The antiferromagnet in question belongs to the magnetic class (Dj), which contains the 
elements 2C3, IR, 2Sf,R, 3a^R. These transformations leave invariant the expression 

the z-axis being along the trigonal axis; the non-zero components are therefore a,, = a„ = a,. 

§52. Helicoidal magnetic structures 

A special category of magnetic structures consists of those in which the periods of the 
“magnetic lattice” are incommensurate with the periods of the original crystal lattice. 
Various mechanisms are possible for the formation of such structures; here we shall 
consider one mechanism noted by 1. E. Dzyaloshinskii (1964), which allows a simple 
formulation in macroscopic terms. This will be done for a specific example, that of a crystal 
of the cubic class T, in which the exchange interaction alone would give a purely 
ferromagnetic order of the magnetic moments (V. G. Bar’yakhtar and E. P. Stefanovskii, 
1969).t 

Eor the structure in fact to exist, it must be stable under small perturbations which 
remove the macroscopic spatial uniformity of the crystal. This condition has been tacitly 
assumed to be satisfied throughout the discussion up to now. The additional non¬ 
uniformity energy resulting from the perturbation was given by (43.1); since this is positive 
definite, the necessary stability does exist. 

The expression (43.2) was derived in §43 as the first non-vanishing term in the expansion 
of the non-uniformity energy in a cubic crystal in powers of the derivatives of the 
magnetization M. There, however, we were concerned only with the energy of exchange 
origin. It was also noted that the crystal symmetry may allow the existence of non¬ 
exchange (relativistic) terms containing products of the derivatives dMfdx,^ and the 
components M^; for this to occur, the symmetry must not include a centre of inversion. The 
crystal class T is such a class; it allows the existence of a term of the form M • curl M that is 
invariant under the symmetry operations in it. The non-uniformity energy thus has the 

form = y M-curl M-t-ia [(grad -|-(grad M^)^-t-(grad MJ^]. (52.1) 

The condition a > 0 does not here guarantee the stability of the uniform state. 

We must take into account, however, the fact that the first-order term in the derivatives 
in (52.1) contains an extra small factor (~ v^jc^) in comparison with the second-order 
term, because of its relativistic origin}; this means that y aja, where a is the lattice 
constant. It is thus possible to find a new stable state while remaining within the range of 
validity of the expansion (52.1): the simple ferromagnetic structure is distorted in a non- 
uniform manner, but only over distances much greater than a, so that the derivatives 
remain small. 

The value of M is determined by the basic exchange interactions (which are not related 
to the non-uniformity). The “large-scale” structure is determined by the minimization of 


t Manganese silicide (MnSi) and iron germanide (FeGe) crystals are of this kind, with magnetic ions Mn and 
Fe which belong to the space group with a simple cubic Bravais lattice. 

t The first term in (52.1) is not invariant under a simultaneous equal rotation of the vectors M at every point in 
space; it does not satisfy the condition stated in the second footnote to §43. 

The relativistic terms may also occur as quadratic ones; for example, cubic symmetry allows a term of the form 

la' [(dMJdf" + (d MJdyf + (d MJdzf]. 
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the energy (52.1). For a given M, this structure consists in a slow change in the direction 

ofM.t 

We seek M(r) as a periodic function 
M 

M(r) = (52.3) 

if is constant, the square of the complex unit vector m (m -m* = 1) must be 

zero: = 0. Such a vector may be expressed as m = (mj + im2)/>y2, where mj and m2 

are two mutually perpendicular real unit vectors. Then 

M(r) = M (mi cos k • r - m2 sin k • r). (52.3) 

Substitution of (52.2) in (52.1) gives 

17 non-U = yk’mxm* + ya 

= M^yk'mj xm2+iaM^/c^. 


This expression has a minimum as a function of k if the vectors k and m, x m2 are collinear 
(parallel if y < 0 and antiparallel if y > 0) and 

/c = y/a<^l/a (52.4) 

Thus the presence of a small additional term in (7non-u’ linear in the derivatives, causes 
the occurrence of a helicoidal magnetic structure superimposed on the basic ferromagnetic 
structure; the magnetic moments of the atoms lie in planes perpendicular to the direction 
of k, and the directions of the moments rotate slowly in successive layers of atoms. The 
ends of the atomic moment vectors on a straight line parallel to k form a helix. The pitch 
Injk of the helix is the period of the superlattice; it is large in comparison with the 
crystallographic periods, and is in general incommensurate with them.f Phases having 
structures of this type are said to be incommensurate. 

In the approximation considered here, the direction of k relative to the crystallographic 
axes remains indeterminate. It is determined by the minimization of the sum of the 
anisotropy energy (40.7) and the relativistic part of the non-uniformity energy. We shall 
not pause to discuss this topic. 


t To avoid complications having no fundamental significance, we assume that is much less than 

(so that M is determined only by the latter), but is also much greater than the relativistic anisotropy energy 
(so that the latter may be neglected in determining the change in the direction of M). These conditions may be 
satisfied in the neighbourhood of the Curie point, neither too close to it to satisfy the first inequality nor too far 
from it to satisfy the second. We shall not pause to set out the explicit limits of this range. 

J The situation discussed here is exactly analogous to that in cholesteric liquid crystals (see SP 1, § 140). In 
these, the presence of a term n • curl n in the energy (where n is the "director” unit vector of the crystal) again leads 
to the occurrence of a helicoidal structure. 
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SUPERCONDUCTIVITY 


§53. The magnetic properties of superconductors 

At temperatures close to absolute zero many metals enter a peculiar state whose 
most striking property, discovered by Kamerlingh Onnes in 1911, is what is called 
superconductivity, i.e. the complete absence of electric resistance to a steady current. 
Superconductivity first occurs at a definite temperature for each metal, called the 
superconductivity transition point, which is a second-order phase transition point. 

From the standpoint of phenomenological theory, however, the change in the magnetic, 
not the electrical, properties is more fundamental as regards the transition to the 
superconducting state; as we shall see, the change in its electric properties is an inevitable 
consequence of its magnetic properties. 

The magnetic properties of a superconducting metal can be described as follows. The 
magnetic field does not penetrate into the superconductor; since the mean magnetic field in 
the medium is, by definition, the magnetic induction B, we can say that throughout a 
superconductor 

B = 0 (53.1) 

(W. Meissner and R. Ochsenfeld, 1933). This property holds whatever the conditions under 
which the transition to the superconducting state occurs. For example, if the metal is 
cooled in a magnetic field, then at the transition point the lines of magnetic force cease to 
enter the body. 

However, it should be mentioned that the equation B = 0 is not valid in a thin surface 
layer. The magnetic field penetrates into a superconductor to a depth large compared with 
the interatomic distances and usually of the order of 10 ® cm, but depending on the metal 
concerned and on the temperature. For the same reason, the equation B = 0 does not hold 
at all in thin films of metal or small particles whose thickness or dimension is of the order of 
the penetration depth. 

In what follows, we shall consider only thick superconductors, and neglect the 
penetration of the magnetic field into a thin surface layer.f 

As we know, the normal component of the induction must be continuous at any 
boundary between two media; this condition follows from the equation div B = 0, which is 


t We shall not give here the theory of phenomena related to the depth of penetration of the magnetic field into 
the superconductor (the Londons’ theory and the Ginzburg-Landau theory). Although these are macroscopic 
theories, the quantities which appear in them have a clear significance only in relation to the microscopic theory. 
These theories are dealt with in SP2. . - u ■ i j 

It must also be emphasized that the present chapter is concerned with “type I” superconductors, which include 
the metallic elements and stoichiometric compounds. In a “type IT’ superconductor (including the superconduct¬ 
ing alloys) the Meissner effect is fully seen only in fairly weak fields. A sufficiently strong field penetrates into a 
type II superconductor without completely destroying its superconducting properties (see SP2, Chapter 5). 
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^versally valid. Since B = 0 in a superconductor, the normal component of the external 
jjeld must be zero on the surface, i.e. the field outside a superconductor must be everywhere 
^Kential to its surface, the lines of magnetic force having the surface as their envelope. 
lUsing this result, we can easily find the forces acting on a superconductor in a magnetic 
lield. As in §5 for an ordinary conductor in an electric field, we calculate the force per unit 
jurface area as where <7,.^ = is the Maxwell stress tensor for a 

[flagnetic field in a vacuum. Since in the present case n -H^ = 0, where is the field just 
^Bide the body, we find 

F^=-H/n/S7i, (53.2) 


i.e. the surface is subject to a compression, of magnitude equal to the field energy density. 
I^ccording to equation (29.4) 

* curl B = 4n^/c, (53.3) 

and from the equation B = 0 it follows that the mean current density is also zero 
gerywhere inside the superconductor. That is, no macroscopic volume currents can flow in 
a superconductor. It should be emphasized that in a superconductor the conduction 
current cannot meaningfully be isolated from ^ as it can in an ordinary conductor. For the 
same reason the magnetization M, and therefore H, have no physical significance here. 

Thus any electric current which flows in a superconductor must be a surface current. The 
surface current density g is given, according to (29.16), by the discontinuity in the tangential 
component of the induction at the boundary of the body. Since B = 0 inside the 
fcerconductor. and B = H outside it. we have 


g = cnxH^/47r. 


(53.4) 


The presence of surface currents is not peculiar to superconductors. Similar currents can 
occur in any magnetized body, and their density is g = cnX(H^ —B)/47t. Since the 
tangential component of H = B//i is continuous on the surface of a normal (not 
Bfejerconducting) body, we have nxH^, = nxB/p, and so the expression for g can be 
written 


g= n 


c 1 — p 
4?: p 


(53.5) 


A fundamental difference between superconductors and other bodies, however, appears 
^en we consider the total current through a cross-section of the body. In a non- 
^Perconductor the surface currents always balance, and the total current is zero. This is 
from the condition (53.5) which relates the current density g to the magnetic induction 
the body, and so to the current g at every point on the surface. In superconductors, 
i^ever, the condition (53.5) has no meaning. For the transition from the ordinary state 
I'^th magnetic permeability p) to the superconducting state corresponds formally to the 
B — 0 and p — O simultaneously. The right-hand side of (53.5) then becomes 
■eterminate, and there is no condition which restricts the possible values of the current. 
—Thus we have the result that the currents flowing on the surface of a superconductor may 
Bount to a non-zero total current. Of course, this can occur only in a multiply-connected 
^dy (a ring, for example), or in a simply-connected superconductor forming part of a 
'Closed circuit which includes also a source of the electromotive force needed to maintain 
currents in the parts of the circuit which are not superconducting. 
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It is very important to note that a steady flow of current on a superconductor is possible 
even if no electric field is present. This means that no dissipation of energy occurs, whose 
replacement would involve work being done by an external field. This property of a 
superconductor may also be described by saying that it has no electric resistance, a result 
which is thus a necessary consequence of its magnetic properties. 


§54. The superconductivity current 

Let us consider in more detail some properties of superconductors which depend on 
their shape. 

If a superconductor is a simply-connected body, then no steady distribution of surface 
currents on it can exist in the absence of an external magnetic field. This can be seen as 
follows. The surface currents would produce in the surrounding space a static magnetic 
field vanishing at infinity. Like any static magnetic field in a vacuum, this field would have a 
potential (j), and by the boundary conditions on the superconductor the normal derivative 
d4>ldn would vanish at the surface. We know from potential theory, however, that if 
d4> jdn = 0 on the surface of a simply-connected body and at infinity, then (/> is a constant in 
all space outside the body. Thus a magnetic field of this kind cannot exist, and therefore 
neither can the assumed surface currents. 

An external magnetic field, on the other hand, causes currents to flow on the surface of a 
simply-connected superconductor, and these currents can be observed through the 
appearance of a magnetic moment of the whole body. This “magnetization” is easily 
calculated for an ellipsoidal superconductor.f 

Let § be the external field, parallel to one of the principal axes of the ellipsoid. The 
relation (1 - n)H -t- «B = § holds for the magnetic field H inside a non-superconducting 
ellipsoid, n being the demagnetizing factor for the axis in question (see (29.14)). In a 
superconductor there is no physical “field” H, as we have already shown, and so the 
magnetization M = (B —H)/47r also lacks its usual significance. Nevertheless, it is here 
convenient to introduce H and M as formal auxiliary quantities in the calculation of the 
total magnetic moment ^ = Ml^ (V being the volume of the ellipsoid), which retains its 
usual physical meaning. Putting B = 0 in the superconducting ellipsoid, we find 

H =§/(!-«), (54.1) 

and 

_ F H /47r = - F §/47r (1 - «). (54.2) 

In particular, for a long cylinder in a longitudinal field « = 0, so that H = § and M 
= — F§/47r.t These values of Ji are the same as would be found if the body had a 
diamagnetic volume susceptibility of -1/471. 

The magnetic field just outside the ellipsoid is everywhere tangential to it, and so its 
magnitude can be determined at once from the condition that the tangential component of 
H be continuous. Within the ellipsoid H = §/(l —«); taking the tangential component, we 


t In the present section we always assume that the magnetic field does not exceed the value at which 
superconductivity ceases (see §55) 

t These relations for a cylinder follow immediately from the continuity condition on H, and are therefore valid 
for a cylinder of any cross-section, circular or otherwise. 
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$sine, (54.3) 

where 6 is the angle between the direction of the external field § and the normal to the 
surface at the point considered. The greatest value of occurs on the equator of the 
ellipsoid, and is $/(l — «). 

It may be pointed out once more that there is no fundamental difference between the 
currents which cause the “magnetization” of a superconductor and those which produce 
the total current in it: their physical nature is the same. This important fact makes possible, 
in particular, an immediate determination of the gyromagnetic coefficients for any 
superconductor. The momentum density of the electrons which form the “magnetizing” 
currents differs from the current density only by a factor mje, e and m being the charge and 
mass of the electron. From the definition of the gyromagnetic coefficients (see (36.3)) it 
follows at once that for a superconductor 

Let us now consider multiply-connected superconductors. Their properties are very 
different from those of simply-connected ones, mainly because it is no longer true that a 
steady distribution of surface currents is impossible in the absence of an external magnetic 
field. Moreover, the surface currents need not balance out, and may result in a steady total 
superconductivity current on the body, even if no external e.m.f. is applied. 

Let us consider a doubly-connected body (i.e. a ring), with no external magnetic field. We 
shall show that the state of such a body is entirely determined if the total current J on it is 
given. The problem of determining the field of the ring can again be solved as a problem of 
potential theory, but the potential (f) is now a many-valued function, which changes by 
471 J/c when we go round any closed path interlinked with the ring (cf §30). In order to 
state the problem in mathematically precise terms, we must draw some open surface which 
spans the ring. Then the problem is to solve Laplace’s equation with the boundary 
conditions d4>/dn = 0 on the surface of the ring, (/> = 0 at infinity, and 
<j>2-4>i = 471 J/c on the chosen surface, where 4) ^ and 4>2 are the values of the potential on 
the two sides of that surface. Such a problem is known from potential theory to have a 
unique solution, which does not depend on the form of the chosen surface. From the field 
near the surface of the ring, we can in turn uniquely determine the surface current 
distribution. 

The self-inductance of a superconducting ring is entirely determinate together with the 
current distribution. Here there is a marked difference from ordinary conductors, where the 
current distribution, and therefore the precise value of the self-inductance, depend on the 
manner of excitation of the current (§34).t 

In §33 we introduced the concept of the magnetic flux C) through a linear conductor 
circuit, and showed that <t> = LJjc, where L is the self-inductance of the conducting circuit. 
For a superconducting ring, the magnetic flux is meaningful for any thickness, not 
necessarily small, of the ring. For, since the magnetic field is tangential, the magnetic flux 
through any part of the surface of the ring is zero; the magnetic flux through every surface 


t The self-inductance of a superconducting ring with radius b, made of wire whose cross-section is a circle with 
small radius a, is the same as the external part of the inductance of a non-superconducting ring, namely 
L = 4nb [log(8fc/a) - 2]; see §34, Problem 2. The exact solution of the problem of a current in a superconducting 
circular ring was first given by V. A. Fok, Physikalische Zeitschrift der Sowjetunion 1, 215, 1932. 
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spanning the ring is therefore the same. Moreover, the formula 

0) = Ujc (54.4) 

remains valid, the self-inductance L being again defined in terms of the total energy of the 
magnetic field of the current. The total energy of the magnetic field of the superconductor 
is given by the integral J d F, taken over all space outside the body. Again spanning 

the ring by a surface C, we can use the field potential and write 

JflMF/87r= - J H-grad(/)dF/87r 

= J (/)divHdF/87r-f//„(/)d//87r. 

The first term is zero, because div H = 0. The surface integral is taken over an infinitely 
remote surface, the surface of the ring, and the two sides of the surface C. The first two of 
these give zero, so that 





4>^)df/Sn 


= {J/2c)(hH„df=JO/2c. 


where $ is the magnetic flux through the surface C. Comparing this with the definition of 
the self-inductance, we have J<t>l2c = LJ^I2c^, which gives (54.4). 

If the ring is in an external magnetic field, the total magnetic flux $ is composed of the 
flux LJjc and the flux of the external field. A very important property of a 
superconducting ring is that, even if the external field and the current vary, the magnetic 
flux through the ring remains constant: 

LJlc + <i)g = $0, a constant. (54-5) 

This follows immediately from the integral form of Maxwell’s equation in the space 
outside the body; 

|H-df= - ^E-dl. 

c 

If the integration on the left-hand side is taken over a surface C which spans the ring, 
contour of integration on the right-hand side is a line on the surface of the ring. On t ^ 
surface of a superconductor, the tangential component of E is zero (since E = 0 insi e ^ 
superconductor and E, is continuous on the surface). Hence the right-hand side is zero, an 
therefore dd)/df = 0. ^ 

The relation (54.5) gives the variation of the current in the ring when the externa 
changes. For example, if the ring is made superconducting m an external field with flux c 
which is then removed, a steady current J = cOo/T flows m the ring. 

The constancy of the magnetic flux through a superconducting ring holds not only wfi 
the external field changes but also when the shape of the ring or its position in space 
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altered-! An intuitive statement of this result is that the lines of force can never intersect the 
surface of the superconductor, and so cannot escape from the aperture of the ring. 

The above results can be immediately generalized to the case of multiply-connected 
superconducting bodies, including sets of rings. The state of an «-ply connected system in 
the absence of an external field is completely determined by the « — 1 total currents J„. The 
relation (54.5) becomes the system of equations 

+ = (54.6) 

These equations hold, not only for any external field, but also for any change in shape or 
relative position. 


PROBLEM 

Determine the magnetic moment of a superconducting disc in an external magnetic field perpendicular to its 
plane-t 

Solution. The problem of a superconductor in a static magnetic field is identical with the electrostatic 
problem of a dielectric with permittivity e = 0. Regarding the disc as the limit of a spheroid as c -► 0 (cf. §4, 
Problem 4), and using (8.10), we find with appropriate change of notation (the field § being along the z-axis) 
M= -2a=S/37t. 

§55. The critical field 

A cylindrical superconductor in a longitudinal magnetic field has an additional magnetic 
energy — L/Str. For a non-superconducting cylinder, on the other hand, the 

total energy would be almost unchanged when the external field was applied (we shall 
neglect the slight diamagnetism or paramagnetism of a non-superconducting metal, i.e. 
take ft = 1). Thus it is clear that, in sufficiently strong magnetic fields, the superconducting 
state must be thermodynamically less favourable than the normal state, and so the 
superconductivity must be destroyed. 

The value of the longitudinal magnetic field at which the superconductivity of a 
cylindrical body is destroyed depends on the metal concerned and on the temperature (and 
pressure). This value is called the critical field //„, and is one of the most important 
characteristics of a superconductor. § 

When the critical field is reached, the superconductivity is destroyed throughout the 
cylinder, because of the uniformity of the field over the surface. In bodies of other shapes, 
however, the destruction of superconductivity is a more complex process, in which the 
volume occupied by matter in the normal state gradually extends as $ increases over some 
range (§56). 


t This statement follows at once from the relation between the induced e.m.f. and the change in the magnetic 
flux through the circuit when it is moved (§63). 

I We consider this problem principally with a view to using the result elsewhere (see §95, Problem 2). For a 
superconducting disc the magnetic fields must in reality be very weak, since its superconductivity is very easily 
destroyed (see §55). 

§ There is a sharp transition between the superconducting and normal states only in type 1 superconductors 
(see the footnote to §53), which are the only ones considered here. In type II superconductors, the destruction of 
superconductivity and the penetration of the magnetic field occur gradually over a fairly wide range of fields, so 
that there is no critical field in the sense here defined. 
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Thus, at any temperature below the transition point, the metal can exist in either tlj^ 
superconducting or the normal state, denoted by the suffixes s and n respectively. 
denote by T) and the total free energies of the superconducting an^ 

normal body in the absence of an external magnetic field; these quantities depend on the 
substance concerned and on the volume, but not on the shape, of the body. The free energy 
in the normal state does not change when the external field is applied, and so we omit the 
suffix 0 in the superconducting state, however, the magnetic field considerably 

affects the free energy. 

For a superconducting cylinder, with given V and T, the free energy in a longitudinal 
external field $ is 

= I'^n. (55.1) 

From this we can derive all the other thermodynamic quantities. Differentiating (55.1) with 
respect to the volume, we find the pressure on the body: 

p = Po(F,r)-$V8^, (55.2) 


where Pp (F, T) is the pressure (for given V and T ) in the absence of the field. The equation 
(55.2) gives the relation between P, V and T, i.e. it is the equation of state for a 
superconducting cylinder in an external magnetic field. We see that the volume F(P, T) in 
the presence of the magnetic field is the same as the volume with no magnetic field but a 
pressure P + $^/87r. This result accords, of course, with formula (53.2) for the force on the 
surface of a superconductor in a magnetic field. 

The thermodynamic potentialf of the superconducting cylinder is cfo^ = ^^ + PV 
= jo {V,T) + Po V, the volume V being expressed in terms of P and Pby (55.2). Hence we 
can write , r 2 \ 

= + (55.3) 


where (P, P) is the thermodynamic potential in the absence of the field. Differentiating 
this equation with respect to T and to P, we obtain analogous relations for the entropy an 
the volume: 


S^,{P,T) = 




(55.4) 





(55.5) 


We can now write down the condition which determines the critical field. The transiti ^ 
of the cylinder from the superconducting to the normal state occurs when cp„ becomes e 
than cp^ (for given P and P). At the transition point we have cp^ = cp„, i.e. 


P + ^,T) = cpJP,T]. 


(55.6) 


This is an exact thermodynamic relation.§ The change 


in the thermodynamic poten 


tial if 


t Bydefinition,the“total”quantitiesi-and^dono,includetheenergyofthemagneticfieldthatwoulde*>»' 

in the absence of the body. 

t Here defined as in §12. ,t,e 

§ We give here calculations more accurate than is usually necessary, so as to exhibit more clearly 
interrelation between the various thermodynamic quantities. 


i 
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,1,6 magnetic field is usually a small correction to &o^o^,T). We can then expand the left- 
l,and side of equation (55.6) in series, taking the first two terms: 

H ^ 

+ ~ V,o(P^T) = &o„(P,T). (55.7) 


yvhere T) — dcfD^Q(P, T)j8P is the volume of the superconducting cylinder in the 
absence of the field. Thus, in this approximation, we can say that the thermodynamic 
potential per unit volume is greater by I Sn in the normal state than its value in the 
superconducting state. 

We denote by = T„{P) the transition temperature in the absence of the magnetic 
field. The transition concerned is a second-order phase transition. Hence, in particular, 
H„(T) must tend continuously to zero at T = T„. We know from the general theory of 
second-order phase transitionst that the change in the thermodynamic potential near the 
transition point is proportional to the square ofT-T„. We can therefore deduce from 
(55.7) that the critical field in this temperature range varies as the temperature difference 
T-T„: 

= constant x (T„-T). (55.8) 

j^ifferentiating both sides of equation (55.6) with respect to temperature (for given 
pressure), remembering that is a function of T, and using (55.4), (55.5), we have 







(55.9) 


where all the quantities are for the point of transition between the two states of 

the body (i.e. for H = H„). Multiplying by 7", we obtain the heat of the transition: 


Q = T(^T^-£T^)= - 


V,H„T/8H^ 


(55.10) 


W. H. Keesom, 1924). When the transition occurs at T = (i.e. in the absence of the 

®agnetic field), the quantity Q vanishes with , in accordance with the fact that we have a 

J^nd-order phase transition. A transition at T < T„ (in a magnetic field) involves 
(■prption or evolution of heat, i.e. it is a first-order phase transition. In practice, 
^creases monotonically with decreasing temperature throughout the range from T„ to 
Hence the derivative 8H„/dT is always negative, and we see from (55.10) that Q > 0, 
absorbed in the (isothermal) transition from the superconducting to the normal 

it entropy of the whole body must vanish, by Nernst’s theorem. Hence 

go Hows from (55.9) that 8HJ8T = 0 for 7^ = 0. i.e. the curve of H„(T) intersects the 
K-axis at right angles. 

We may differentiate the difference 5^, (55.9) again with respect to temperature, 
again use equations (55.4), (55.5). Since also {8S/8P)^= -(ev/8T)„ the result is 

8T or 


^or the superconductivity transition, the Landau theory n 
^lions, up to the transition point itself; see SP2, §45. 
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Multiplying both sides of this equation by T, we obtain the difference of the specific heats 
(at constant pressure) of the two phases. The terms involving the thermal-expansio^ 
coefficient and the compressibility are usually very small in comparison with the remaining 
terms; neglecting them, we have 


4n " 4n\ dT ) ' 


(55.12) 


This formula could also be obtained by direct differentiation of the approximate relation 
(55.7). In this approximation the difference between and and between and 
may be neglected. 

For T=T„, the first term in (55.12) is zero, and we obtain the following formula, which 
relates the change in specific heat in the second-order phase transition (in the absence of an 
external magnetic field) to the temperature dependence of 


(A. J. Rutgers, 1933). Hence we see that in this case As the temperature falls, i.e. 

when the superconductivity is destroyed by the magnetic field, the difference 
changes sign, because the difference ^ is zero for 7" = 0 and for T = T„, and must 

have a maximum in between. 

We can similarly discuss effects related to the change in volume in the transition. To do 
so, we differentiate equation (55.6) with respect to pressure (for given temperature), 
being a function of P. This gives 


_ dH„ 
471 8P 


which determines the change in volume at the transition point.)" For 7" = this difference 
is zero, like the entropy difference. The transition at temperatures T < T„, however, 
accompanied by a change in volume, which may be of either sign, depending on the sign ^ 
the derivative (dH„ldP\. For T = there is no change in volume, but the compression 

is discontinuous; the discontinuity is easily found by differentiating equation (55. > 
may be noted that, if we substitute in (55.14) 


(obtained by differentiating the equation T/erfP, 7") — constant), we obtain 
Clapeyron-Clausius equation 


ediate si 
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^^here the derivative {8P/dr)„^ defines the change in pressure needed to keep the applied 
^prnal field critical when the temperature changes. 

The physical significance of the critical field H„ is much wider than would appear from 
its definition in terms of the behaviour of a superconducting cylinder. The equation 
0 = Her is a condition of equilibrium which must be fulfilled at every point of a surface 
grating normal and superconducting phases in the same body. This is evident from the 
following simple arguments. If a cylinder is in a longitudinal magnetic field H„, then both 
the boundary conditions on the magnetic field and the conditions of thermodynamic 
stability are satisfied for all states in which an interior cylindrical part is in the 
superconducting state and the rest of the body is in the normal state, and the field at the 
boundary between these parts is H„. Thus the surface of separation, on which H = H„, is 
in neutral equilibrium with respect to its location. This is a characteristic property of phase 
brium. 

In a variable magnetic field, the boundary between the superconducting and normal 
phases changes its position. The kinetics of this process is very complex, and its discussion 
requires a simultaneous solution of the equations of electrodynamics and of thermal 
conduction, taking into account the heat evolved in the phase transition. We shall not 
pause to carry out this investigation,! but merely give the boundary condition which must 
he satisfied at the moving boundary between the normal and superconducting phases. 

To derive this condition we take a coordinate system K' moving with the velocity v 
of the boundary between the phases. By the formulae for transformation of fields, the 
electric field E' in the system K' is related to the fields E and B in a fixed system K by 
E'= E + vxB/c; see (63.1). Since the boundary is at rest in the system K', the usual 
condition of continuity of the tangential component of E' holds, i.e. n x E' = n x E - uB/c 
must be continuous, where n is a unit vector normal to the surface, in the direction of the 
velocity v. In the superconducting phase E = B = 0, and in the normal phase B = H„ at 
•he boundary. We therefore find that a tangential electric field appears on the moving 
boundary, its direction being perpendicular to that of the magnetic field and its magnitude 
being 

E = vH„/c. (55.16) 


?S 6 . The intermediate state 


a superconducting body of any shape is in an external magnetic field $ which is 
dually increased, a stage is finally reached where the field at some point on the surface of 
^^0 ody becomes equal to the critical field H„, but $ itself is still less than For 
ample, on the surface of an ellipsoid placed in a field $ parallel to one of its axes, the 
^eat^t value of the field occurs on the equator (see (54.3)), and is equal to H„ when 


. en § increases further, the body cannot remain entirely in the superconducting state, 
r can it pass entirely into the normal state, because then the field would become § 
ywhere. Hence the superconductivity must be lost only in part, 
first sight one might imagine that this process occurs as follows. As $ increases the 


i/Seel.M.Lifshiu 
1 90, 363, 1953. 
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superconductivity is lost in a gradually increasing part of the body, while a gradually 
decreasing part remains superconducting, and the whole body becomes normal when 
^ It is a easy to see, however, that such states of the body are thermodynamicallj 

unstable. On the surface separating the superconducting and normal phases the magnetic 
field is, as we know, tangential to the surface, and its magnitude is //„. That is, the lines of 
force are on the surface. If the boundary is convex to the normal phase, the equipotentia] 
surfaces of the field, being at right angles to the lines of force, will diverge into the normal 
region, as shown by dashed lines in Fig. 33a. The field decreases, however, in the direction 
in which the equipotential surfaces diverge, so that we should have H < in the shaded 
region, contrary to the supposition that this region is in the normal state. If, on the other 
hand, the boundary of the superconducting phase is concave, then the lines offeree on that 
boundary must have a bend on the free surface of the superconducting region, to which the 
field is tangential (at the point 0, in Fig. 33b). At a bend in a line offeree, however, the field 
becomes infinite, which again contradicts the boundary conditions at the surface of the 
superconductor. I 



Fig. 33 


The above arguments represent essentially another form of the situation which lea s 
the domain structure of ferroelectrics and ferromagnets. Here also the conditions 
thermodynamic stability have the result that, if the magnetic field reaches the va ue a 
even one point on the surface, the body is divided into numerous parallel alternating ^ 
layers of normal and superconducting matter (L. D. Landau, 1937). This state 
superconductor is called the intermediate state. As S increases, the total volume 
normal layers increases, and when the body becomes entirely normal. 

In general, a body of arbitrary shape need not be entirely in the intermediate sta 
may also be regions in the purely superconducting and purely normal states; .. 

separated by the region which is in the intermediate state. A simpler case is the _ 
already considered. In a field parallel to the axis the intermediate state occurs m 

and the whole volume of the ellipsoid is in this state. Fora sphere 

the intermediate state exists in the range < j< cr^ oyi cy^^^ ^ cylinder „ 
field n = h and the corresponding range is !<; total 

rongitudim field, e = 0; there hi no intermedk.e smte, and .he su^reonduct.v.ty 11» ^ 
desfroyed a. 6 - H„. Finally, for a fiat plate in a transyerse field a - 1, and it iS 
intermediate state for any field §<//„. J 
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The intermediate state can also be described in an averaged manner if the thickness of 
the regions under consideration is large compared with the layer thickness (R. E. Peierls, 
and F. London, 19^). In this description it is assumed that there is inside the body a 
magnetic induction B which varies from zero in the purely superconducting state to H„ in 
the purely normal state. If we ascribe a non-zero induction to Ae matter in the intermediate 
state, we must also ascribe to it a definite magnetic “field” H. To determine the relation 
between these quantities, we must consider the true structure of the intermediate state. 

The magnetic field in a normal layer at its boundary with a superconducting layer is , 
and by virtue of the assumed smallness of the layer thickness we can suppose that the field 
has this value everywhere in the normal layers. In the superconducting layers B = 0. Hence, 
averaging the magnetic fie]d over a volume large compared with the layer thickness, we find 
that the mean induction B = x„H„, where is the fraction of the volume that is in the 
normal state. Next, we determine the thermodynamic potential per unit volume of the 
body, taking as zero the value for the purely superconducting state. In the absence of a 
magnetic field, unit volume of the normal phase has an excess thermodynamic potential 
When a magnetic field is present, a further is added as magnetic energy, 

giving altogether The mean thermodynamic potential per unit volume in the 

intermediate state is therefore 

O = = H„B/47i. (56.2) 

The relation between B and H is obtained from the general tjprmodynamic relation 
H = 4nd<i>/dB. In the present case we find that H is parallel to B and its magnitude is 

H = H^, (56.3) 

i.e. it is independent of the induction. 

If the relation between B and H is shown graphically (Fig. 34), then the segment (M of 
the axis of abscissae corresponds to the superconducting state, and the line BC {B = H) to 
the normal state. The vertical line AB (H = H^) corresponds to the intermediate state. 

Let n a unit vector in the direction of the lines of fojue of the averaged magnetic field. 
Putting H = H^n and substituting in the equation curl H = 0 (which holds in the absence 
of a volume current), we find that curln = 0. Since = 1, we have 
grad n^ = 2(n • grad)n -h 2n x curl n = 0, 



Fig. 34 


t Here we neglect all magnetostriction effects. Instead of the change in the thermodynamic potential we could 
therefore speak of the (equal) change in the free energy. 
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and therefore (n -grad)n = 0. This means that the direction of the vector n is constant. 
Thus the lines of force of the mean field are straight lines. 

Let us apply these results to an ellipsoid m the intermediate state. For a uniform field 
inside the ellipsoid, the relation {l-n)H + nB = ^ holds, whatever the relation between B 
and H. Putting H = H„, we have 

B = (56.4) 

Thus the mean induction in the ellipsoid varies linearly with the external field, from zero 
when § = (1 - n)H„ to when § = 

We may also write down an expression for the total thermodynamic potential ^ of an | 
ellipsoid in the intermediate state. To do so. we start from the general formula 

(cf (32.7)), which is also valid whatever the relation between B and H. Substituting O, H 
and B from (56.2)-(56.4), we obtain 

= (56.5) 

V being the volume of the ellipsoid; is taken to be zero in the purely superconducting 
state of the ellipsoid, in the absence of a magnetic field. For a superconducting ellipsoid in 
an external field § we have 

= -1^/.^ = V $2/87r(l - n), (56.6) 

in accordance with (32.6) and (54.2). For $ = - n), the thermodynamic potential and 

its first derivative with respect to the temperature are continuous; in this sense, the 
transition from the superconducting to the intermediate state is analogous to a second- 
order phase transition.! 

It should be emphasized that the averaged description of the intermediate state giv® 
here is in reality not very accurate, because of the comparatively large thickness ^ ^ 
layers. For the same reason, this description fails to reproduce certain phenomena 
to the properties of the layer structure. These include the fact that the transition from t 
superconducting to the intermediate state actually occurs only when $ slightly exce 
(1 - n)H„. The reason for this “delay” is as follows. The passage into the intermediate sta ^ 
occurs when that state becomes thermodynamically stable, i.e. when ^ , 

structure, however, has not only the “volume” energy (56.5) allotted to it in the avera^^^ 
description but also additional energy resulting from the existence of the boun a 
between the layers and their change in shape near the surface of the body. This resu 
some displacement of the transition point towards stronger fields. 

As mentioned in the footnote to §53, this chapter deals with type I supercon ^ 
Nevertheless, a minor comment may be made here concerning the thermodynamics 


t It is therefore not surprising that gD,(S) < so,(S) 
order phase transition, neither phase exists beyond the t 
the thermodynamic potentials of the two phases. 


both sides of the point § ^ 
nsition point, and hence it i 


= //cr(l-n). Inasecol 

i meaningless to comP‘ 
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Magnetization curve” for a cylindrical type II superconductor. Such bodies are 
Jaracterized by a gradual penetration of the magnetic field into them. For example, with a 
long cylindrical superconductor in a longitudinal magnetic field §, the penetration begins 
yv'hen the field reaches some value j (7"), and the superconductor changes continuously 
jjito the normal state only at a field 2 > 

We start from the relation d^IdH = (32.4). Integrating both sides over H from 

0 to f/cj 2> we find 

- J J^d^ = 

0 

where relates to the superconductor in the absence of the field, and is independent 
of the external field (so that the tilde can be omitted from both). In the field range 
O^H ^ the field does not penetrate into the cylinder, whose magnetic moment is 
therefore = —V^/4n. Separating this part of the integral gives 


Jld^= -(£^„-^so) + - 


871 


If is defined as before, but purely formally, for a type II superconductorj 
the relation obtained may be written in the final form 

»cr,2 


(56.7) 


PROBLEM 

Determine the heat capacity of an ellipsoid in the intermediate state. 

Solution. The entropy, and thence the heat capacity, are found by differentiating the thermodynamic 
ential (56.5) with respect to temperature. Neglecting the terms containing the thermal-expansion coefficient, 
obtain ^ 

— 1(1 - «)(//„'" -I- ], 

priine denoting differentiation with respect to 7"; is the heat capacity of the body in the superconducting 
Ihe h dependence on § we here neglect. Hence it follows that, as § varies (at constant temperature), 

neat capacity changes discontinuously at the point § = (1 -n)H„ from 'i?, to 


*'"1 thereafter varies linearly with §, reaching the value 
5 = Hct, whence it falls discontinuously to <(?„. 


superconductor in the field range between i and H , is called the 
“'e emphasized that this is not the same as the intermediate state of a type I superconductor. In 

nuxed state, the magnetic field penetrates into the sample in the form of what are called vortex filaments 
The vaJue of lies between those of i and //cr,2- It has, however, no distinctive property in a type II 
*^rconductor. ’ ’ ^ 
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§57. Structure of the intermediate state 

The shape and size of the normal and superconducting layers in the intermediate state 
are governed by the conditions of thermodynamic equilibrium for the body as a whole, in 
the same manner as was the shape of the domains in a ferromagnet (§44). As there, the 
resulting thickness of the layers is determined by two oppositely acting factors. The 
surface tension at the normal and superconducting phase boundaries tends to reduce the 
number of layers, i.e. to increase their thickness. The energy of emergence of the layers at 
the free surface of the body has the opposite tendency. The layer thickness increases with 
the size of the body, and consequently (for the same reasons as in the case of ferromagnetic 
domains) the layers must ultimately undergo branching near the surface.! 

The problem of determining the shape and size of unbranched layers in the intermediate 
state in a flat plate can be solved exactly. We shall do this on the assumption that the 
external field $ is perpendicular to the plate (L. D. Landau, 1937). 

The layers are parallel to the field, except near the surface of the plate. The lines of 
magnetic force (shown dashed in Fig. 35, p. 196) pass only through the normal layers, and 
the boundaries of the superconducting layers are also lines of force, since B„ = 0 there. 
Since also H = on the boundary between the normal and superconducting phases, the 
conditions at the boundaries of a superconducting layer are 

onBC (57.1) 

on BA and CD J 

the coordinate axes being taken as shown in Fig. 35. Far from the plate, the field H must be 
the same as the external field §, i.e. 

for X --00 H,= ^, H^ = 0. (57.2) 

We use the scalar and vector potentials: H^= -dtpidx SA/dy, Hy= -d(p/dy- 
- 8A/8x, and the complex potential w = (p-iA (cf. §3). 

On a line offeree A = constant. We put A = 0 on the line of force which reaches U ana 
then branches into OCD and OB A, forming the boundary of one superconducting layer- 
The difference between the values of A at the boundaries of two successive supercon u 
ing layers is equal to the magnetic flux across the segment a = a, + a„, namely pa. He 
the value of A at the boundary of any superconducting layer is an integral multiple o 
Using also the “complex field” t] = = -dw/dz, z = x + iy, we can write 

conditions (57.1) as 

on BC rei; = 0, I (57.3) 

on BA and CD \rj\ = H„. j 


We introduce a new variable 

(57.4) 

C = exp (- 2nw/ $u) - 1 

and regard i/ as a function of C. C is real on all boundary lines of force and on thei 
continuations beyond the plate: C = exp (- 27i<^/§a) 1- chose*’ 

Since <P is determined apart from a constant, its value at any one point can be 


-^ is near zero Or a filamentary structure 

In certain conditions, when the structure- see E R. Andrew, Proceedings of the Royal SoO ■ : 

rmodynamically more favourable than the layer structure. 
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Mtrarily. Let (j) = Oat 0. Then C = 0 there also. On the limiting line of force considered, 
^from the plate, C = — 1 (since for x ^ — oc we have <p + co). The value of C 

B or C, where the line of force enters the plate, is Co, say. On CD and BA, C varies from Co 
(0 oc. Then the conditions (57.1) and (57.3) can be written 

for C = — 1 = $, (57.5) 

forO<C<Co ret] = 0, I 

for Co < C Wl = H„. j 

The function r](Q must, furthermore, be everywhere finite. 

The conditions (57.6).are satisfied by the function 




For real negative C the two roots are real, and are taken with the signs shown. For 
0 < C < Co both are imaginary, and we take 


'‘■[Jf-JM] 

n OC and OB respectively. Fo 


with the minus and plus signs on OC and OB respectively. For C > Co 

„ r If. /Col 


with the minus and plus signs on CD and BA respectively. The value of Co is found from the 
condition (57.5), and is 

( 57 ., 

Where h = $///„■ 

The shape of the layer, i.e. the equation of the limiting line of force, is obtained by 
rotegrating the relation dz = — dw/r/ over real C: 

_ _ ^ f dC 

^ J 27iJr/(C + l)' 

Substituting r/(C), taking real and imaginary parts, and choosing appropriately the 
'Constants of integration, we obtain the following parametric equations of the line CD: 


J ^ ~ VCo (in - tan -1 Vo, 


"'fere Y = ja^ is the value of y for x -► c»; see Fig. 35. 
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The period a of the layer structure is related to the thicknesses and a„ of the 
superconducting and normal layers by a = a54-a„, = a„H„. The latter equation 

follows from the continuity of the magnetic flux, which passes entirely through normal 
layers. Hence = a(l —h), a„ = ah. 

The period a is determined by the condition that the total thermodynamic potential of 
the plate be a minimum. The existence of surface tension at the boundaries between the 
normal and superconducting phases gives a term 

= 21H^,^A/Sna 

in the thermodynamic potential per unit area of the surface of the plate. Here / is the 
thickness of the plate, and the surface-tension coefficient is written as whei« ^ 

has the dimensions of length. In calculating this part of the energy we can, of course, neg eo 
the curvature of the layers near the surface of the plate. 

The energy of emergence of the layers at the surface of the plate can be written as the su 
of two parts. First, the increase in the volume of the normal layers as compared with t 
volume they would occupy if they were everywhere plane-parallel gives an addition 
energy 


4 


(y-y)dx 


8n 


= 


(57.i: 
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0-j Structure of the intermediate state 

■jiere the factor 4 takes into account the presence of four angles (such as B and C in Fig. 35) 
pp the two sides of each of the 1/a superconducting layers. 

• Second, the emergence of the layers at the surface of the plate changes the energy of the 
*stem in the external field, i.e. the energy - ■ §. The magnetic moment of the plate is 

doe to currents on the surfaces of the superconducting layers. When the tangential 
^^ponent of the induction changes discontinuously from H to zero, the surface current 
density isg = ±cHI4n. Hence the magnetic moment per unit length in the z-direction and 
per boundary surface of the superconducting layer is 


-I 


ds = ^(dx^ + dy^). 


OCD 

If the layer did not emerge at the surface, there would be no segment OC, and on CD we 
should have y = Y. Hence the excess magnetic moment for each of the four angles is 


OCD 0 

Accordingly, the excess energy is 

0 OCD 

= |(-ydx + yds)+ 


(57.12) 


The coordinates x and y, expressed in terms of C, are proportional to a. Hence all the 
■ntegrals in + c /02 proportional to a^, and this part of the thermodynamic potential 
‘s therefore proportional to a. The sum gOj + c /02 + therefore of the form 

^=^['^ + a/(^)]. (57.13) 

condition that this be a minimum gives 

a = y[/A//(fi)]. (57.14) 

The integrals in (57.11) and (57.12) can be calculatedf, and the result obtained for/(fi) is 

f(h) = -^{a+hf log (l+h) + {\- hf log (1 - h) 

-(l + h^f\og(\+h^)- 4h^ log 8h }. (57.15) 


t See A. Fortini and E. Paumier, Physical Review B 5, 1850, 1972. 
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Fig. 36 

The limiting forms of this are 

fih) = ~ for h<^l; 

n 

(57.16) 

m = ^^{l-hr for 1-Ml. 

Figure 36 shows a graph of / (h). 

It should be noted that, in normal layers near the surface of the plate, the magnetic field 
may be considerably less than i.e. there is a situation corresponding to that shown in 
Fig. 33a.t In this case the unfavourable thermodynamic state is made possible by the 
surface-tension energy, which prevents further reduction in the layer thickness. 

As already mentioned, as the plate thickness increases, branching of the layers must 
occur. This in turn affects the dependence of the period a of the structure on /; in the limit of 
multiple branching, a cc Numerical calculations show, however, that branching 
should begin only at a comparatively late stagej. 


t For instance, when /i = i the field on the surface in the centre of the normal layer is only 0.73 f/cr, and a® 
h — 0 it tends to 0.65 Her. 

t The calculation for a model with multiply branched layers is given by L. D. Landau, Journal of Physics 7, 
1943 (Collected Papers, Pergamon Press, Oxford, 1965, p. 365). 
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QUASI-STATIC ELECTROMAGNETIC FIELD 


§58. Equations of the quasi-static field 

So FAR we have discussed only static electric and magnetic fields, and have used Maxwell’s 
equation 

1 

curlE=--^ (58.1) 

c 8t 

only as a step in deriving the expression for the energy of a magnetic field (§31). 

The nature of the variable electromagnetic fields in matter depends greatly on the kind of 
matter concerned and on the order of magnitude of the frequency of the field. In the 
present section we shall consider the phenomena which occur in extended conductors 
placed in a variable external magnetic field. We shall assume that the rate of change of the 
field is not too large, and therefore satisfies various conditions which will be derived below. 
Electromagnetic fields and currents which satisfy these conditions are said to be quasi¬ 
static. 

We shall first of all suppose that the wavelength X ^ c/co which corresponds (in the 
vacuum or dielectric surrounding the conductor) to the field frequency co is large compared 
with the dimension / of the body: co -4 c/l. Then the magnetic field distribution outside the 
conductor at any instant can be described by the equations of a static field: 

divB = 0, curlH = 0, (58.2) 

all effects due to the finite velocity of propagation of electromagnetic disturbances being 
neglected. Of course, this neglect is permissible only at distances from the body which are 
small compared with A; these are the only distances which need be considered in 
determining the field inside the body. 

The complete system of field equations inside the conductor consists of (58.1) together 
with 

divB = 0, (58.3) 

curl H = 47rJ/c, j = <tE; (58.4) 

in an electrically anisotropic (non-cubic) crystal, we must write = cTf^E^. The second 
of these equations has been derived, strictly speaking, only for steady currents 
and static magnetic fields. It is therefore necessary to specify conditions under which 
this equation can reliably be used for variable fields. In equation (58.4) the current has been 
written in terms of the electric field in accordance with the relation j = ctE with constant ct, 
which holds for a steady state. This relation remains valid if the period of the field is large 
compared with the characteristic times of microscopic conduction. That is, the field 
frequency must be small compared with the reciprocal mean free time of the electrons in 
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Quasi-static Electromagnetic Field 
the conductor. For typical metals at room temperature, the limiting frequencies given bj 
this condition lie in the infra-red region of the spectrum.t 
There is another condition which restricts the applicability of the equations in this case 
Equation (58.4) assumes that the relation between the current and the field is a local one, i.e 
that the current density at a point in the conductor depends on the field at that point only 
This in turn presupposes that the electron mean free path is small compared with the 
distances over which the field changes appreciably. We shall return to this condition in §59 
In equations (58.1) and (58.4), E is the induced electric field resulting from the variation 
of the magnetic field. When H is known, the field E can be immediately determined bj 
equation (58.4). The equation for H is obtained by eliminating E from (58.1) and (58.4): 


4ndB 

c^ dt 


(58.5) 


In a homogeneous medium with constant conductivity a and constant magnetic 
permeability /x, the factor 1 /ct can be taken in front of the curl operator, and by (58.3) we 
have div B = /X div H = 0. Hence curl curl H = — AH, and we obtain the equation 


AH 


4ng(T 0H 
c^ dt 


(58.6) 


With the equation div H = 0 this suffices to determine the magnetic field. It may be noted 
that equation (58.6) is a heat-conduction equation, the thermometric conductivity x being 
represented by c^/4izfia. 

The boundary conditions on the magnetic field at the surface of a conductor are evident 
from the form of the equations, and are as before 

B„i = B„2, H,i=H,2. ( 58 - 7 ) 

The expression on the right-hand side of equation (58.4), being bounded, does not affect 
the second of (58.7). For /x = 1 we can put simplyj 

Hi = H2. (58-8) 

From (58.4) we have div j = 0; the boundary condition on this equation is that = 0 a* 
the surface of the conductor. In an electrically isotropic conductor, it follows (since 
j = ctE) that £„*■’ = 0 at the boundary, the index (i) denoting the field inside the conducton 
in the general case of an anisotropic conductor, the normal component of the field in 
conductor at its surface is in general not zero. 

The boundary condition (58.8) is insufficient for a complete formulation of the pr^ 
if the conductor is composite and its parts have different conductivities. At the interface 


t For poor conductors (e.g. semiconductors), equation (58.4) is valid only if a further condition, which 
more stringent, is satisfied. For such bodies it may be possible to define both a conductivity and a y jn 

Then a term - (e/c) dE /dt is added to the right-hand side of (58.4), and the condition for this term to be sm 
comparison with Ano¥.fc is ofw > e. In good conductors (e.g. metals), on the other hand, o/o >1 throug 


frequency range in which the conductivity can be regarded as constant (see also the s^ond footnote 
1 For ordinary diamagnetic and paramagnetic bodies, n is very nearly 1, and the inclusion o\n in the lo 
f^rmnlpp wnnid be a nointless refinement. Values of u differing considerably from 1 occur in ferromagn 


formulae would be a pointless refinement. Values of differing considerably . 

metals, whose magnetic properties (in sufficiently weak fields) can be described in terms of a large co ^ 
permeability. For quite moderate frequencies, however, such substances exhibit a dispersion of p 
dependence of on the frequency co), together with a decrease ofalmost to 1. We shall therefore put = 
present chapter. 




conduct 
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jjgtween the parts we must use both the continuity of H and that of E,; the latter implies the 
condition 

(curl H),i/cti = (curl H),2 /ct 2 (58.9) 

pp the magnetic field. 

Suppose that a conductor is placed in an external magnetic field which is suddenly 
fCinoved. The field in and around the conductor does not vanish immediately; the manner 
pf its decay with time is given by equation (58.6). To solve a problem of this kind, we use the 
following procedure, a general technique in mathematical physics. We seek solutions of 
equation (58.6) which have the form H = H„(x,>',z)e“T’'"‘, where is a constant. The 
equation for the function H„,(x, y, z) is then 

(cV47r<7)AH„= (58.10) 

For a conductor of given shape, this equation has non-zero solutions (satisfying the 
necessary boundary conditions) only for certain y„, the eigenvalues of (58.10), all of which 
are real and positive.t The corresponding functions H„,(x,y,z) form a complete set of 
orthogonal vector functions. Let the field distribution at the initial instant be Ho(x.y,z). 
On expanding this in terms of the functions H„,: 

Ho (x, y, z) = X c„ H„, (x, y, z), 

we obtain the solution of the problem; 

H(x,y,z,t)= Xcme"^"'H.(x,y,z) (58.11) 

gives the manner of decay of the field with time. 

The rate of decay is determined principally by the term in the sum for which y^ is least; let 
this be Fj. The decay time of the field may be defined as r = l/yi. The order of magnitude 
oft is evident from equation (58.10). Since AH ~ H//^, where / is the dimension of the 
conductor, we have 

T ~ Ancl^jc^. (58.12) 

§59. Depth of penetration of a magnetic field into a conductor 

Let us consider a conductor in an external magnetic field which varies with frequency co. 
The magnetic field penetrates into the conductor and induces in it a variable electric field, 
"hich in turn causes currents to appear; these are called eddy currents.X A general idea of 

^ This is easily seen as follows. So as to avoid having to take account of the boundary conditions at the surface 
® the body, we start from equation (58.5) and suppose <t to vanish continuously outside the body. Multiplying 
“fh sides of the equation 

-4jiv„H„/c^ = -curl[(l/(T)curlH„] 
nd integrating over all space, we have 

curl ^“^dK=ji| curl H„|^dK, 


ar 
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the way in which the field penetrates into the conductor can be obtained from the analogj 
already mentioned between equation (58.6) and the equation of thermal conduction, fi jj 
known from the theory of thermal conduction that a quantity which satisfies such aj 
equation is propagated through a distance ~ in time t. We can therefor^ 

immediately conclude that the magnetic field penetrates into the conductor to a distance ^ 
given in order of magnitude by ^ ~ .^(c^/ctco). The same is true, of course, of the induced 
electric field and currents. 

In a variable field with frequency co, all quantities depend on the time through a factor 
Equation (58.6) then becomes 

AH = — 47riCT£oH/c^. (59 jj 

Let us consider two limiting cases. If the penetration depth 6 is large compared with the 
dimension of the body (low frequencies), we can put the right-hand side of (59.1) equal to 
zero as a first approximation. Then the magnetic field distribution at any instant will be the 
same as it would be in a steady state with the same external field far from the body. Let this 
solution be it is independent of the frequency (or rather involves the frequency only in 
the time factor e“‘“'). The induced electric field appears only in the next approximation, 
being absent in the steady state. This corresponds to the fact that curl Hj,j = 0, and so the 
value of E obtained from (58.4) is zero. To calculate E, therefore, we must use equation 
(58.1), according to which 

curl E = icoHst/c. (59.2) 

This equation, together with div E = 0 (which follows from (58.4) when a is constant in the 
body), entirely determines the electric field distribution. It is seen to be proportional to the 
frequency co. 

The opposite limiting case is that where 5 < I (high frequencies). The condition for the 
field equations to be local (§58) requires that 6 should still be large compared with the 
mean free path of the conduction electrons.! 

When d<^l the magnetic field penetrates only into a thin surface layer of the 
conductor. In calculating the field outside the conductor we can neglect the thickness of 
this layer, i.e. assume that the magnetic field does not penetrate into the conductor at all. If 
this sense a conductor in a high-frequency magnetic field behaves like a superconductor if 
a constant field, and the field outside it must be calculated by solving the corresponding 
steady-state problem for a superconductor of the same shape. 

The true field distribution in the surface layer of the conductor can be investigated m f 
general manner by regarding small regions of the surface as plane. It is necessary to so v 
equation (59.1) for a conducting medium bounded by a plane surface, outside which 
field has a given value say. This vector is obtained as shown above, by solving t 

problem for a semi-infinite medium, and is parallel to the surface of the conductor 
boundary condition (58.8) shows that the magnetic field in the conductor is also Hq^ 
the surface. ip 

We take the surface of the conductor as the xy-plane, the conducting medium being ^ 
z > 0. Since the conditions of the problem are independent of x and y, the required fie 
depends only on the z coordinate (and on the time). We therefore have div H = 

t This condition is, in fact, the first to be violated in metals as the frequency increases. The condition w ^ . 

where t is the mean free time, may, however, be the more stringent for semiconductors of low conductiV 
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^ boundary it must be zero everywhere. By (59.1), the equation 

for H IS d H/0Z + k H - 0, where k = J{4nwo)/c^) = (1 + f) J{2naco)/c. The solution 
of this equation which vanishes far from the surface (z oo) is proportional to Using 
tl,e boundary condition at z = 0, we obtain 


H = 

where the penetration depth d is 

^ = clyj(InocS) and k= (I + iys. 


(59.3) 

(59.4) 


The electric field is now determined by means of equation (58.4). If n is a unit vector in 
the z-direction, we have 


E= y(co/87rCT)(l-/)Hxn. 


(59.5) 


Thus E ~ Hd/2. 

If the field Hoe-'“‘ is linearly polarized, then Ho can be made real by a suitable choice of 
the origin of time. We then take the direction of Hp as the v-axis. Taking the real part in 




^ = = Hf,^{o}/4n(T]e ^/^cos^ - cotj. 


(59.6) 


The eddy current density j = aE has the same distribution as E. 

The relation (59.5) is valid in the present case for the field throughout the half-space 
> U. In more general cases, a relation 


E, = CH,xn (597) 

to conductor for the field components tangential 

side nfTu components are continuous, (59.7) applies to the field on either 

shall r t ^ coefficient C is called the surface impedance of the conductor; we 

eturn in §87 to more general aspects of this concept.t In the present case. 


C= ^(w/8na){l-i). (59.8) 

of eddy currents implies a dissipation of the field energy, which appears as 
e- The time average energy Q dissipated in the conductor per unit time is 

tondnlt ~ calculated as the mean field energy entering the 

‘Juuctor per unit time: „ 


Q = |S-df = (c-/47t)|ExH-df, 


•n electrically anisotropic media, the surface impedance is a two-dimensional tensor: 
,,he, E„ = C„,(H,xn)„ 
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the integral being taken over the surface of the conductor.! 

We have already seen that, in the limiting case 6^ I, the amplitude of the magnetic field 
inside the conductor is independent of the frequency, while that of the electric field is 
proportional to co. The energy dissipation Q at low frequencies is therefore proportional to 
£0^. When d <^l,on the other hand, the magnetic and electric fields on the surface of the 
conductor are given by formulae (59.3) and (59.5) with z = 0. The Poynting vector is 
normal to the surface, and its mean value is S = (c/167r) (£o/27rCT) | Hq | ^, the variation of 

Hq over the surface being given by the solution of the problem of the field outside a 
superconductor of the same shape (cf above). The energy dissipation is 

Thus at high frequencies it is proportional to y/co. 

The energy dissipation can also be expressed in terms of the total magnetic moment ^ 
acquired by the conductor in the magnetic field. In a periodic field, the magnetic moment is 
likewise a periodic function of time, with the same frequency. According to formula (32.4), 
the rate of variation of the free energy is given by where § is a uniform 

external field in which the conductor is placed. This expression does not immediately give 
the required energy dissipation, because the energy of the body changes not only on 
account of dissipation but also by the periodic movement of energy between the body and 
the surrounding field. If we average over time, however, the latter contribution vanishes, 
and the mean dissipation of energy per unit time is 

Q= -„//-d§/dt. (59.11) 

If.///and § are written in complex form, then d§/dt = — ia»§, and g can be calculated as 
Q = -ire(/cu.y/-§*) = icoim (.//•§*). (59.12) 

The components of the magnetic moment ^ are linear functions of the external field: 

(59.13) 

where the dimensionless coefficients a;^ (co) depend on the shape of the body and on its 
orientation in the external field, but not on its volume V. In this formula we assume that M 
and § are written in complex form, so that the a,^ are also in general complex. The tensor 
Vafi^ may be called the magnetic polarizability tensor for the body as a whole. This tensor is a 
generalized susceptibility, and has the properties common to all such quantities. In 
particular, it is symmetrical (see SP 1, §125): 

a.-fc = (59.14) 

We can therefore write 

* Sfc = i Fai, (* ) 

= I a.-, re ($;$,*). 

tif any two quantities a(t) and b(t) are written in complex form (proportional to the real parts must of 

course be taken before calculating their product. If, however, we are interested only in the time average value of 
t he prod u ct, it m ay be calculated as j re ab*. The terms containing e±2i“' give zero on averaging, and so 
j (a + a*)(b + b*) = i(ab* + a*b). In particular, S can be calculated as the real part of the “complex Poynting 
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If we also write the complex quantities as ccf + iaLf', the energy dissipation (59.12) 
becomes 

Q = \ $k*)- (59.15) 

Thus the energy dissipation is determined by the imaginary part of the magnetic 
polarizability. We have already seen that Q is proportional to of for low frequencies, and 
to J(o for high frequencies. We can therefore conclude that the quantities ccf in these two 
limiting cases are proportional to co and to 1 l^co respectively. Since they decrease both as 
CO->0 and co -► oo, they must have a maximum in between. 

The magnetic moment of a conductor in a variable magnetic field is due mainly to the 
conduction currents set up in the body; it is not zero even if /x = 1, when the moment in a 
static field vanishes. The latter can be obtained from ..// (co) by taking the limit as 
CO 0. Hence it follows that the real part af of the polarizability tends to a constant limit 
as CO -> 0 (the limit being zero for p = 1), corresponding to magnetization in a static 
field. In the limit co -> cx), when the magnetic field does not penetrate into the body, af 
tends to a different constant limit, corresponding to the static magnetization of a 
superconductor of the same shape. 


PROBLEMS 


Problem 1. Determine the magnetic polarizability of an isotropic conducting sphere with radius a in a 
uniform periodic external field. 

Solution. The field H"' inside the sphere satisfies the equations A H'*'+ = 0, divH“’ = 0, where 

/( = (1 + i)/5. We write this field in the form H*‘' = curl A. where A satisfies the equation A A + fc^A = 0; since H 
is an axial vector, A is a polar vector. By symmetry, the only constant vector on which the required solution can 
depend is the external field §. ,We denote by / the spherically symmetrical solution, finite for r = 0, of the scalar 
equation A/ + kf = 0, namely / = (1/r) sin kr. Then the polar vector A, which satisfies the vector equation A A 
+ Ea = 0 and depends linearly on the constant axial vector can be written as A = p curl(/ §), where /? is a 
constant. Thus we have 

H“' = P curl curl(/S) 

= p(j + + 


where n is a unit vector in the direction of r; the second derivative /" has been eliminated by means of the 
equation A/+ kf = 0. 

The field H*"” outside the sphere satisfies the equations curl H*®’ = 0, div H*"’ = 0. We put H*'’ = — grad <l> 
+ §;()) satisfies the equation A(j> = 0 and vanishes at infinity. Since (p depends linearly on the constant vector §, 
we have <l> = grad (1/r), where V = 4rM^/3. Thus 

H**’= Fa grad [(§ • grad)(l/r)] + § 

= ^[3(u-S)u-S] + S. 


It is evident that Va^ is the magnetic moment of the sphere, so that Va is its magnetic polarizability (by 
symmetry, the tensor reduces to a scalar 

On the surface of the sphere (r = a), all the components of H must be continuous. Equating separately the 
components parallel and perpendicular to n, we obtain two equations to determine a and p. The polarizability per 
unit volume is found to be 




3 r 3S sinh(2a/S)-sin(2a/S) 1 
cosh (lajS) - cos (2a/5) J’ 
93^ f a sinh(2a/5) + sin(2a/^)1 

IfiTta^L ^ cosh(2a/5) —cos(2a/5)J 
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In the limit of low frequencies > a). 


, _ 1 /a Y 47t 0*0^0)^ 

“ "" lOc^ ■ 


For high frequencies a). 


Hi 3^1 3r 3c 1 

StiL 2a] SttL 2ay(27t<Tm)J’ 

9 ^ 9 c 

1671 a I6na^(2707(0) ’ 


The limiting value Va' = —2“^ corresponds to the magnetic moment of a superconducting sphere; the 
corresponding value of a" could be found from formula (59.10), using the expression (54.3) for the field at the 
surface of a superconducting sphere. 

The external field is assumed to be written in the complex form §= w'tlt arbitrary constaiu 

complex vector ^q. The analysis therefore includes both the “linearly polarized” variable field whose direction is 
constant, and the elliptically or circularly polarized fields which rotate in some plane. 

Problem 2. The same as Problem 1, but for a conducting cylinder (with radius a) in a uniform periodic 
mangnetic field perpendicular to its axis. 

Solution. This problem is the “two-dimensional analogue” of Problem 1. In what follows all vector 
operations are two-dimensional operations in a plane perpendicular to the axis of the cylinder, and r is the 
position vector in that plane. The field inside the cylinder is of the form 

= /? curl curl(/§) 

= + ^v)(n • m 

where / = Joikr) is the symmetrical solution of the two-dimensional equation A f+k}f= 0 which is finite for 
r = 0. The field outside the cylinder is 

-2Fagrad[(S-grad)logr]H-S 


- [2(n-S)n-S]H-S, 


where V = na^. The magnetic moment per unit length of the cylinder is Fa§ (see §3, Problem 2). From th® 
condition H*” = H*'’ for r = a, as in Problem 1, we obtain 


a= » [1 ^ Ji(M] 

271L kaJoika)]’ 


using the relation J^'{kr)= —kJi{kr). 

For a, expanding the Bessel functions in powers of ka, we have 


“ 247tG) 6c* ’ 

871V^/ 4c^ 

le the asymptotic expressions for the Bessel functions, obtaining 

= _ ^ _ \ ■ 

2n V 0/ 271 V fl.y(27tcrco)/ 


0y(27«707)/ 

1 a c 

2n a 2na^(2na<o) 
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problem 3. The same as Problem 2, but for a magnetic field parallel to the axis of the cylinder. 

SOLUTON.^ The magnetic fieW is everywhere parallel to the axis of the cylinder. Outside the cylinder 
v*et'a''f " - S, and inside it H ' = / §, where / is the symmetrical solution of the two-dimensional equation 

^y+fcy=Owhichislforr = u and finite for r = = S Jo The eddy currents in the cylinder are 

jjmuthal (i.e. the only non-zero component is^), and are given in terms of the field H = H by 4m/c = -dH/dr 
^ magnetic moment generated per unit length of the cylinder by the conduction currents is ^ = nJa 
,(l/ 2 c)jjrdV - yj(dH/dr)r dr; it is parallel to the axis. Evaluating the integral, we have 

_ ir 2Ji(M] 

471L lcaJo(fca)J' 

Phus the longitudinal polarizability of the cylinder is half the transverse polarizability derived in Problem 2. 

Problem 4. Determine the least decay coefficient for the magnetic field in a conducting sphere. 

SOLUTION. The solutions of equations (58.10) for a sphere include functions with various symmetries The most 
symmetric^ defined by an arbitrary constant scalar. This solution is inapplicable 

however, for he following reason; it would be spherically symmetrical (H = HJr)) and would have to be 
fl = constant/r in orto to satKfy the equation divH = (l/r)e(rH)/dr = 0, which is valid both outside and inside 
the sphere; but this function is not finite at the centre of the sphere 

The least value of y comeyonds to one of the solutions defined by an arbitrary constant vector. The form of 
these so utions IS evidently the same as has been found in Problem 1, the only difference being that the constant 
must be omitted so as to have H = 0 at infinity. The quantity fc is now real (= J(4nay/c% 
and the vector H is an arbitrary constant vector. From the boundary condition H''> = H''> at r = a we obtain two 
equatioris, and on eliminating a and)? we find sin ka = 0. The smallest non-zero root of this equation is feu = ir 
and so the smallest value of y is yj = nc^/4aa^. 

Problem 5. The plane surface of a uniaxial metal crystal is cut so that the normal is at an angle 6 to the 
surface impedance, taking account of the thermoelectric 

ellect (M. I. Kaganov and V. M. Tsukernik, 1958). 

feLUTiON. We take the crystal surface as the xy-plane, and the z-axis along the inward normal to the surface- 
ai the f the crystal be in the xz-plane at an angle 6 to the z-axis. Let the magnetic field 

Since all f-dnection: Then it is in this direction everywhere within the metal also, 

teome ^ depend only on z (and on the time as e “”'), we find that Maxwell’s equations (58.1), (58.4) 

= iy=h=0, EJ = iwH^lc, (1) 

hB^toiic'tffect* w^ ^ den^otes differentiation with respect to z. To take.account of the 


it include also the equation of thermal conduction, CST/dt + dn'q = 0, o 
— icoCT + gJ = 0, 


»l’ereru. ■ (2, 

'“'“me and temperature: T = T+x,C the specific heat of the metal per unit 

^''Rtensor^ ^“sdy; j and q are related to the field E and the temperature gradient by (26.12). 

O:. n .. an “’•‘■symmetrical. Weshallsupposethesymmetryofthecrystaltobesuchthata.n 

■ nEn.™ nf .E. ^ gjjjJ 2 


'“tensorspij = Liandic,-. 

Metrical also. With the above choice of the x, 


1^6-Epj^c 


Pxz = (f> II - Pi) sin e cos e, 

S vafid 


Ti (26.12), 

^x = Pxxix + Qx = Ta^xJx - 


'mination of from equations (l)-(3) gives 


EJ' + k^{E^ + Ej) = 0, 
n + a) Er" + k^[(b-a) Er~aE^] = 0, 

t' and the parameters 

k^ = 47tico/c^p„, a = ra„Vp„h-„, b = c^Cp^J4nK^ 
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The solution of these two equations for the half-space z > 0 occupied by the metal is 

Er= - (1 - - (1 - ki^lk^)Be''^^^, 


ri + b±v'{(l-fc)'-4ufc}t 

1 2n+a) J’ 


the imaginary parts of Ic, and k^ must be positive. r *u * ~ 

The relation between the coefficients A and B is found from the boundary conditions for the temperature, 
- ' • ■ -^ required to be continuous at the 


formula (4) gives the field E, in the metal; the normal component of E 
surface of the conductor. According to the definition (59.7a). the surface impedance is 


c k^A + k2B’ 

c,. = o. 

The final expressions are as follows for the impedance when a <? 1 (as is in fact I 
cases. With an isothermal boundary (r = 0), 

cJ"’ = Co[i+«/2(i+N/fc)" ]; 


for ordinary metals) in two 


with an adiabatic boundary = 0), 

C J""*’ = Co [1 + «(1 + 2v/fc)/2(l + y/bf I 

where Co = (“P /8it)" (1 - 0 is the impedance when the thermoelectric effect is neglected. The parameter a, and 
therefore the correction to the impedance, are zero when 6 = {nor 6 = 0, i.e. when the principal axis is either in 
the plane of the surface or perpendicular to it. , .t , 

If the field H is in the x-direction, then £, = 0,^ =j, = 0, and there is no temperature gradient, so that 

C,, = Co. 


§60. The skin effect 

Let us consider the distribution of current density over the cross-section of a conductor 
in which a non-zero and variable total current is flowing. From the results of §59 we should 
expect that, as the frequency increases, the current will tend to be concentrated near the 
surface of the conductor. This phenomenon is called the skin effect.^ 

The exact solution of the problem of the skin effect depends, in general, not only on the 
shape of the conductor but also on the manner of excitation of the current in it. i.e. the 
nature of the variable external magnetic field which induces the current. An important 
particular case, however, is that where the current flows in a wire of thickness small 
compared with its length; here the current distribution is independent of the manner of 
excitation. 

In calculating the current distribution over the cross-section of a thin wire, the latter may 
be regarded as straight. The electric field is parallel to the axis of the wire, and the magnetic 
field vector H is in a plane perpendicular to the axis. 

Let us consider a wire of circular cross-section. This is a particularly simple case, because 
the form of the field outside the wire is immediately obvious. By symmetry, E = constant 
over the surface of the wire (though the value of the constant varies with time). With this 


t The term skin effect is used, more generally, in all cases where a variable electromagnetic field (and therefore 
the currents due to it) penetrates only to a relatively small depth in a conductor. 
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^Jndary condition, the only solution of the equations div E = 0, curl E = 0 outside the 
jfjre is E = constant. Similarly, the magnetic field outside the wire must be the same as it 
*ild be outside a wire carrying a constant current equal to the instantaneous value of the 
^Kble current. 

Inside the wire, the electric field satisfies the equation A E = (47t(T/c^)dE/dt, which is the 
jjnie as equation (58.6) for H; it is obtained by eliminating H from (58.1) and (58.4), just as 
|58.6) was obtained by eliminating E. In cylindrical polar coordinates, with the z-axis along 
the axis of the wire, the only non-zero component of E is , which depends only on r. For a 
periodic field with frequency <u we have 


1 ^ 
r dr 




v /(20 


1+i 
3 ’ 


(60.1) 


where 3 is the penetration depth (59.4). The solution of this equation which remains finite 
at r = 0 is 

E = E^ = constant x Jo(/cr)e“'“', (60.2) 

where Jq is the Bessel function. The current density j = aE is similarly distributed. 

The magnetic field = H is found from the electric field by equation (58.1): 


= (curl E)^ = - dEJdr. (60.3) 

Since Jo'(u) = — Jj (u), we obtain 


El = H^= -constant x i^(47t(Ti7<u) Jj {kr)e (60.4) 

the constant being the same as in (60.2); it is easily determined from the condition that 
H = 21/ca on the surface of the wire, a being the radius of the wire and I the total current in 
it. 

In the limiting case of low frequencies {a/3 -4: 1) we can take just the first few terms of the 
wpansions of the Bessel functions at every point in the cross-section: 


= constant x 

IT 27 t(T r 1 ,1 . 

= constant x — r 1 - - ((r/ri)^ - — {r/3f e “ 


(60.5) 


Th ■ 

^^nmplitude of E, and therefore that of the current density, increase as 1 -b (r/2ri)‘^ with 
■Pasing distance r from the axis. 

^^the opposite limiting case of high frequencies {a/3 > 1) we can use the asymptotic 
Jo[wy(20]~u-Ie<‘-‘)", (60.6) 

is valid for large values of the argument of the Bessel function, over most of the 
^“-section. Retaining only the rapidly varying exponential factor, we have 
E. = constant x 

= constant x (l-b i) e ~ 

formulae are, of course, the same as (59.3> (59.5), which are valid near the surface of 
Conductor of any shape when the skin effect is strong. 


(60.7) 
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In the general ease of a wire whose cross-section is not circular, the exact calculation of 
the skin effect is considerably more involved, since the fields inside and outside the wire 
must be determined simultaneously. Only in the limiting case of strong skin effect is the 
problem again simplified, because the field outside the wire may then be determined as the 
static field outside a superconductor of the same shape. 


§61. The complex resistance 

If the frequency of the variable current is low, the instantaneous current J (t) in a linear 
circuit is determined by the instantaneous e.m.f S: 

S{t) = RJ{t), (61.1) 


where R is the resistance of the wire to a constant current. 

There is no reason, however, to expect a direct relation between the values of S and J at 
the same instant for all frequencies. We can say only that the value of J (t) must be a linear 
function of the values of S{t) at all previous instants. This relation may be symbolically 
written as J = or, conversely. 

S = ZJ, (61.2) 


where Z is some linear operator.t If the functions (f(f) and J (t) are expanded as Fourier 
integrals, then for each monochromatic component (depending on time through a factor 
e “'“'), the effect of the linear operator Z is simply multiplication by a quantity Z which 
depends on the frequency: 

S = Z(w)J. (61.3) 


The function Z(<u) is in general complex. It is called the complex resistance or impedance of 
the conductor. 

It is evident from a comparison of (61.3) and (61.1) that the ordinary resistance R is the 
zero-order term in an expansion of the function Z (<u) in powers of <u. To find the next term, 
we must take account both of R and of the self-inductance L of the conductor.f 

Let us consider a linear circuit containing a variable e.m.f (t). By the definition of(f, the 
work done per unit time by the electric field on the charges moving in the wire is J. This 
work goes partly into Joule heat and partly to change the energy of the magnetic field of the 
current. By the definition of R and L, the Joule heat evolved in the wire per unit time is 
and the magnetic energy of the current is LJ^jlc^. The law of conservation of energy 
therefore gives the equation 


\ LJ^ „ ,2 1 , , dJ 

t 2c^ c^ dt 


or 


(61.4) 


t We shall not pause here to discuss the general properties of this operator, since they are entirely analogous t 
those of the operator e, which will be examined in detail in §§77 and 82. 
t Here, and in what follows, R and L denote the values for a steady current. 
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In order to use the quadratic expressions J and P we must write S and J as real 

B tions. Having derived the linear equation (61.4), however, we can take complex 
ochromatic forms: = <fo J = Then equation (61.4) gives the algebraic 


relation 


Z = R — ^coL. 


(61.5) 


faking the real part in J = <f/Z, we have 


, cos(mf - (j)), tan </. = coL/c^R, (61.6) 

V(R +« /- /O 

which determines the amplitude of the current and the phase difference between the 
current and the e.m.f 

The real part of the expression (61.5) is the resistance R, which determines the energy 
dissipation in the circuit. It is easy to see that, whatever the function Z(<u), a similar relation 
holds between re Z and the energy dissipation for a given current. On averaging with 
respect to time the power S’ J required to maintain the periodic current in the circuit, we 
obtain the part of this power which continually makes good the dissipative losses. The 
energy dissipation in the circuit per unit time is therefore (2 = ^ re (SJ*), where S and J are 
expressed in complex form; see the last footnote to §59. Substituting = ZJ and denoting 
the real and imaginary parts of Z by Z' and Z" respectivelyif 


Z = Z' + iZ", (61.7) 

: we obtain Q = yZ'| Jp or, in terms of the real function J(t), 

Q = Z'(co)J^, (61.8) 


j which gives the required relation. 

It may be noted that, since Q is necessarily positive, Z' is also positive: 

Z' > 0. (61.9) 

We may calculate Z(<u) for a wire of circular cross-section for any frequency which 
®tisfies the quasi-steady condition, i.e. without neglecting the skin effect. To do so, we 
*Eain use the law of conservation of energy, but in a different form. We divide the power S J 
*nere S and J are real) into two parts, one being the change in the magnetic field energy 
®'‘tside the wire, and the other the total energy consumed inside the wire (both in changing 
’afield and in evolution of heat). The second part can be calculated as the total energy flux 
'tering the conductor through its surface per unit time. Thus we have 


\ 2c^ 


cEH 

An 


^J^ + ~cEHal, 
dt 2 


here is the external part of the self-inductance of the wire, E and H the electric and 
[ «8netic fields at its surface, a its radius, and I its length. The field H is related to the current 
°y H = IJjca. Hence, dividing the above equation by J, we have 


M Sometimes called the resistance and reactance (in Russian: active and react 


live resistances). 
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This is a linear equation, and hence we can use complex quantities. Then 
icoL^ 


f = ZJ= - 


^J + El, 


whence 

^ i<u , El 

Z = - 2 ~~ - 2 jj ■ (ul.lQj 

C^ ^ J r caH ' 

For general frequencies, E and H are given by (60.2) and (60.4), and we have 

Jo(M 


y T. + - Rka 


Ji(M’ 


(61.11) 


where R = llna^a. When the skin effect is weak, we use the expansions (60.5); taking terms 
as far as {a/Sf and separating the real part, we find 




(61.11a) 


In the opposite c 


5 e of a strong skin effect we use the expressions (60.7), obtaining 
Z' = Ra/2S = {l/ca)J{co/2nG), 

r , -I ( 6112 ) 

23 1 ar. Ic ^ 


cur T , 1 

* ay(27r<TCu)J' 


ayJ(2nGO}) 

It is seen from (61.11a) that we can put Z' = R if (nacaa^lc^^ 12. We also have 
Z"|Z' = coT/c^R = (7t(TCoa>^)/21og(/H where L is given by (34.1). Comparing this with, 
the inequality just given, we see that the range of frequencies in which the expression (61.51 
can be used to take the self-inductance into account depends on the ratio l/a and is fairy 

In practice, however, the most important case is that in which the self-inductance of th£ 
circuit is due mainly to coils in it. whose self-inductance is large compared with that o 
uncoiled wire (see §34). In such circuits formula (61.5) (i.e. equation (61.4) with constan 
and L) can be used over a fairly wide range of frequencies. , 

Let us consider a circuit in a variable external magnetic field H^, which may be . [J 
in any manner. We denote by the electric field which would be induced by the van 
field in the absence of conductors. Both and vary only very slightly jj 
thickness of a thin wire (unlike the field of the currents in the wire). We can 
discuss the circulation of E^ round the current circuit without specifying the exact p 
of the contour of integration in the wire. This circulation is Just the e.m.f. S tiofj 

circuit by the variable external magnetic field. By the integral form of Maxwe s eq 
we have 


1 S f, 

^ = |E.-dl=--^j! 


(61 


IdO, 

— - iH,-df=- 

c dt j c dt 

where O, is the flux of the external field through the circuit. Substituting this expression 
equation (61.4), we obtain ^ ^ 




dt ■ 


r dt 
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-faking the self-inductance term to the right-hand side, we have 

c df dt c df ’ 


(61.14) 


where <I> = <I>g-)-LJ/cis the total magnetic flux from the external magnetic field and the 
field of the current. In this form the equation gives Ohm’s law for the whole circuit, i.e. the 
equality of R J to the total e.m.f in the circuit. 

The formulation of equation (61.14) as expressing Ohm’s law makes possible a 
generalization of it to the case where the shape of the circuit also varies with time. The self¬ 
inductance L is then a function of time, and (61.14) becomes 




1 dO) 
c df 


(61.15) 


In deriving this from the law of conservation of energy we should have to take into account 
also the work done in deforming the conductor. 

If there are several circuits in proximity, carrying currents J„, then for each of them in 

equation (61.14) is the sum of the magnetic fluxes due to all the other circuits (and to the 
external field, if any). The magnetic flux through the ath circuit due to the current J(, is 
where is the mutual inductance of the two circuits. We therefore have the 
following set of equations for the variable currents in the circuits: 


= (61.16) 

The sum over b includes the self-inductance term {b = a), and is the e.m.f produced in 
the ath circuit by sources external to the system of currents considered. 

For monochromatic periodic currents, the system of differential equations (61.16) 
becomes a set of algebraic equations: 


= (61.17) 

where the quantities 


^ab = KbK -(61.18) 

c 

the impedance matrix. Like (61.5), the expressions (61.18) represent the first terms in 
" expansion of the functions Z„Jco) in powers of the frequency, 
t should be noted that, in this approximation, the circuits have no mutual effect on the 
parts of their impedances. Such an effect arises because the magnetic field of the 
^ riable current in one conductor generates eddy currents, and therefore an additional 
ssipation of energy, in another conductor. For linear conductors this effect is negligible 
"1 it may become important if extended conductors are located near them 
. Finally, let us consider how the equations of variable currents in linear circuits obtained 
J’ this section are related to the general equations of a variable magnetic field in arbitrary 
"nductors. We shall take the simple example of the current set up in a circuit when a 
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■f. Sq is removed at time t = 0. From equation (61.4) we havet 

J = (fo/R for t < 0, 1 

J = for t > 0. j 

We see that, after the removal of the e.m.f., the current decays exponentially with time, the 


(61.19) 


decrement being 


y = c^R/L. 


( 61 . 20 ) 


If the problem is exactly formulated, this y is the smallest of the y^ obtained by solving the 
exact equation (58.10) for the conductor in question. Among the y^ for a linear conductor 
there is one, the smallest, which is less than the others by a factor of the order of log (//q) 
and this is (61.20). 


§62. Capacitance in a quasi-steady current circuit 

A variable current, unlike a steady one, can flow in an open circuit as well as in a closed 
one. Let us consider a linear circuit whose ends are connected to the plates of a capacitor, 
which are at a small distance apart. When a variable current flows in the circuit, the 
capacitor plates will be periodically charged and discharged, thereby acting as sources and 
sinks of current in the open circuit. 

Since the distance between the capacitor plates is small, the magnetic energy of the 
current can again be taken as LJ^/2c^, where L is the self-inductance of the closed circuit 
which would be obtained by joining the capacitor plates by a short piece of wire, t The only 
change in equation (61.4) is then to add to the voltage drop RJ across the resistance the 
potential difference e/C between the capacitor plates, where C is the capacitance and ± e(t) 
the charges on the plates. We obtain 


1 dJ e 


The current J is equal to the rate of decrease and increase of the charges 
J = de/dt. Expressing J in terms of e, we have 


the two plates. 


1 ,d^e de e 

-^L—y + R-j- + 
c^ dt^ df C 


( 62 . 1 ) 


This is the required equation for a variable current in a circuit with a capacitance. 

If (f is a periodic function of time having frequency co, then equation (62.1) reduces 
algebraic relation between and the charge e, or between and the current J = — Itae- 
have, in fact, JZ = S, where the impedance Z is defined by 


( 62 . 2 ) 


t strictly speaking, these formulae are invalid for very small t, when 


time, however, the current J ca 
times with sufficient accuracy. 

t In the present section we neglect 


change significantly, and so formula (61.19) gives the current at st 
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-pglcing real parts in the relation J = SjZ, we obtain 


J(t) = 


S(,cos (cot-(f)) 

7RF5)’]' 


_1 
I coC ) R 


(62.3) 


which give the current in a circuit to which an external e.m.f. „ cos cot is applied. 

If ^ = 0, the current in the circuit consists of free electric oscillations. The (complex) 
frequency of these oscillations is given by Z = 0, whence 


‘M-m 


We may have either periodic oscillations damped with decrement Rc^/IL or 
an aperiodically damped discharge, depending on the sign of the radicand. In the limit 
as K-»0 we have undamped oscillations whose frequency is given by Thomson’s 
formula: 

CO = clJ(LC) (62.5) 

(W. Thomson, 1853). 

Equation (62.1) can be immediately generalized to a system of several inductively 
coupled circuits containing capacitors. The current J„ in the ath circuit is related to the 
charges +£„ on the corresponding capacitor by = de„/dt, and equation (62.1) is 
replaced by the set of equations 


For periodic (monochromatic) currents, these give the algebraic equations 


(62.6) 


= (62.7) 

b 

•Fe matrix elements Z^j, being given by the formulae 

= + ( 62 . 8 , 

eigenfrequencies of the current system are given by the condition of compatibility of 
" ations (62.7) when = 0, i.e. by the condition 


Ifth( 


det |Z„i,| = 0. 


(62.9) 


re resistances R are not zero, all the “frequencies” have a non-zero imaginary part, and 
® electric oscillations are therefore damped. 

^ It should be noticed that equations (62.6) are formally identical with the mechanical 
^nations of motion of a system with several degrees of freedom which executes small 
^'tiped oscillations. The generalized coordinates are represented by the charges and 
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the generalized velocities by the currents J„ = de„/dt. The Lagrangian of the system ij 

(62.10) 

The kinetic and potential energies of the mechanical system are represented by the magnetic 
and electric energies of the current system, and the quantities correspond to the 
externally applied forces which cause the forced oscillations of the system. The quantities 
appear in the dissipative function 

^ = (62.11) 
Equations (62.6) are the analogues of Lagrange’s equations 


d dSe _ d Si 

dt de„ de^ de ^' 


(62.12) 


PROBLEMS 


Problem 1. Determine the eigenfrequencies of electric oscillations in two inductively coupled circuits 
containing self-inductances L, and and capacitances C, and Cj, neglecting the resistances P, and RE¬ 
SOLUTION. The required frequencies are determined from the condition 
det|Z„sl = Z„Zjj-Z,j^ =0, 




Calculation gives 


^ _ ^L^C^+ L:,C^ + ^ [(L^Ci - L^C^f 

Both frequencies are purely real, owing to the fact that Rj and Rj have been neglected. As 2 -»0, w, and 0)21““* 

to clyJ(LiCd and clyJiL^Ci). These are the frequencies for the two circuits separately. 

Problem 2. The same as Problem 1. but for a circuit consisting of a resistance R, a capacitance C and an 
inductance L connected in parallel. 

Solution. The impedances of the three branches are Z, = R, Zj = i/toC, Z3 = - icoL/c^, and the curre”**' 
them are such that 0, Z. J, = Z,J, = Z,J,. Hence we have 1/Z, I/Z2 + I/Z3 = 0, when 


""vLlc 4R^C^J 


Problem 3. Discuss the propagation of electric oscillations in a circuit consisting of an infinite successi 
identical meshes containing impedances 


of 



as shown in Fig. 37. Find the range of frequencies which can be propagated in the circuit without damping 


t The condition for the quasi-steady theory to be applicable to such a periodic circuit is that the dimensicP 
one mesh should be small compared with the “wavelength" c/co. . 
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563 



Fig. 37 


Solution. The current in mesh a is denoted by i„ as shown in Fig. 37. Kirchhoff’s second law gives for this 
nesh Ziia + ^2 -1 ~ 'a +1) = 0. This is a linear difference equation in the integral variable a, with constant 

coefficients. We seek the solution in the form = constant x if, obtaining for the parameter q the equation 



Let - 4 $ Zi/Zj $ 0, corresponding to values of lying between c VF, Ci and + 1/C, )I{AL^ + L,). 

Then equation (1) has two complex conjugate roots with moduli |g| = 1. This means that the current does not 
decrease from one mesh to the next, i.e. the electric oscillations are propagated in the circuit without damping. 
Putting q = e‘", where / is the length of one mesh and Ic is the “wave number” of the oscillations propagated in the 
circuit, we can calculate the velocity of propagation u from the general result u = d<u/dlc. 

If, however, to is outside the range mentioned, equation (1) has two real roots and ^2. say; since ^2 = 1. one 

root (qi, say) is less than 1 in absolute magnitude, while §2 is greater. It is easy to see that the propagation of 
undamped oscillations in the circuit is then impossible. To elucidate the reason for this, let us consider a circuit of 
large but finite length. An initial oscillatory impulse is given to one end of the circuit, the other end being closed in 
some manner. This closure corresponds mathematically to a certain boundary condition, by means of which we 
can determine the ratio of the coefficients c, and C2 in the general solution = c, + C2 ^2 ~ ~ where A 

is the “coordinate” of the end of the circuit. This ratio is of the order of unity. As >1 — a increases, the second term 
in the solution rapidly becomes very small compared with the first term, because 1^2! > Thus the solution is 
= everywhere except for a small part near the end of the circuit, and |i,,| decreases towards the end of 

the circuit. 

It should be emphasized that this damping does not involve dissipative absorption, because there is no 
resistance in the circuit; it can be imagined as being th? result of reflection of the oscillatory impulse from each 
successive mesh of the circuit. 


§63. Motion of a conductor in a magnetic field 

Hitherto we have tacitly assumed that a conductor in an electromagnetic field is at rest in 
c h!! reference K in which E, H, etc. are defined. In particular, the relation j = uE 

'’'tween the current and the field is generally valid only for conductors at rest. 

o determine the corresponding relation in a moving conductor, we change from the 
K to another frame K' in which the conductor, or some part of it, is at rest at the 
J^nt considered. In this frame we have j = uE', where E' is the electric field in K'. The 
“•■known formula for the transformation of fieldst gives E' in terms of the field in K: 


"'here 

"hich 


E' = E + vxB/c. (63.1) 

V is the velocity of K' relative to K, i.e. in this case the velocity of the conductor, 
we of course suppose small compared with the velocity of light. Thus we find 


j = <t(E + vxB/c). (63 2) 

This gives the relation between the current and the field in moving conductors. The 


^See Fi^lds_^ §24. The microscopic values of the electric and magnetic fields are replaced by their averaged 



218 Quasi-static Electromagnetic Field 

following remark should be made concerning its derivation. In going from one frame of 
reference to the other we have transformed the field but left the current j unaltered. Th^ 
correct transformation of the current density gives only terms of a higher order of 
smallness if r < c. In formula (63.2) the second term, which appears as a result of the fiei(j 
transformation, is in general not small compared with the first term, despite the factor r/c 
For example, if the electric field is due to electromagnetic induction from a variable 
magnetic field, its order of magnitude contains a factor 1/c as compared with the magnetic 
field. 

The energy dissipation in a conductor when a given current flows in it cannot, of course, 
depend on the motion of the conductor. The rate of evolution of Joule heat per unit volume 
in a moving conductor is therefore given in terms of the current density by the same 
expression f/a as for a conductor at rest. The expression j • E, however, is replaced byf 
;Vu = j-(E+vxB/c). 

Thus, in a moving conductor, the sum E + vxB/c acts as an “effective” electric field 
producing the conduction current. Hence the e.m.f. acting in a closed linear circuit C is 
given by the integral 

,? = f(E + vxB/c)-dl. (63.3) 

This expression can be transformed as follows. According to Maxwell’s equation, 
curl E = - {l/c)BB/Bt, and so 


^E-dl =|curlE-df= 


fE-dl = - 


or, denoting by O the magnetic flux through the surface S. which spans the circuit C, 

l/^\ 

The time derivative with the suffix v = 0 denotes the rate of change of the magnetic flux due 
to the time variation of the magnetic field, the position of the contour C remaining 
unchanged. j 

In the second term in (63.3), we put v = du/dt, where du is an infinitesimal displacein 
of the circuit element dl. Then 


fvxB-dl = fduxB-dl/dt= - |B-df/dt, 

where df = du xdl is an element of area on the “side” surface between two infinitdy do® 
positions C and C of the current circuit, which it occupies at times t and t + (b ig- 
Since the total magnetic flux through any closed surface is zero, the flux through tn 


t Itisi 
field is 


conductor ir 


I from this formula that the additional heat evolved in 


time^t ini 


nductor moving i 


I magf®*^ 


5rJjvxBdK/c= -(^u-jxBdK/c, 

,St is the displacement in time 5t. This expression is equal and opposite to the work done on < 
time St by the volume forces f = j x B/c. This explains the apparent contradiction mentioned m s 


§63 
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surface must evidently equal the difference of the fluxes through surfaces spanning C and 
C. Thus 

I V X B • dl = - (5a)/5t)B = constant, 
c 

where the time derivative denotes the rate of change of the magnetic flux due to the motion 
of the conductor in a constant field. 

Adding the two terms, we have finally 


= -(l/c)da)/dt. 


(63.4) 


where the time derivative now denotes the total rate of change of the magnetic flux through 
the moving circuit. Thus the expression (63.4), which is Faraday’s law, is valid whatever the 
reason for the change in the magnetic flux, whether variation of the field itself (already 
discussed in §61, formula (61.13)) or motion of the conductor. 

In a static magnetic field, the change in the flux may be due entirely to the motion of the 
circuit. If the circuit moves in such a way that every point of it moves along a line of force, 
then the flux through the circuit does not vary. This is an obvious result of the fact that the 
magnetic flux through any closed surface is zero, and the flux through the side surface 
described by the moving circuit is in this case identically zero (since = 0 on this surface). 
Thus we can say that, to induce an e.m.f., the conductor must certainly move so as to cross 
hnes of magnetic force. 

The electromagnetic field in a moving conductor is given by the equations 


curlE = — (l/c)5B/5f, 
curl H = 47rj/c = {Anajc) (E + v x B/c), 
div B = 0. 


(63.5) 


^’‘pressing E in terms of H by means of the second equation and substituting in the first, 
obtain 


f-cur>,.xB,= -£lcuH(£H^). («.6, 

a homogeneous conductor with constant conductivity a and constant magnetic 
•^rmeability g, we have 


^ese equations generalize those obtained in §58. 


div H = 0. 


(63.7) 
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It should be pointed out, however, that, if there is only one conductor moving as a whoig 
(without change of shape) in an external magnetic field, then the solution of the problem jj 
considerably simplified if we use a system of coordinates fixed in the conductor. In this 
system the conductor is at rest, and the external field varies with time in a given manner, so 
that we return to the eddy-current problems discussed in §59. This possibility does not 
depend on Galileo’s (or on Einstein’s) relativity principle, since the new system of 
coordinates is in general not inertial. The equivalence of the problems results from the 
above-mentioned fact that the electromagnetic induction is independent of the cause of the 
change in the magnetic flux. This equivalence can also be demonstrated mathematically. To 
do so, we expand the expression curl (v x B), using the facts that div B = 0 and (for motion 
of the body as a whole) div v = 0 (i.e. the body is incompressible). Then the left-hand sideof 
equation (63.6) becomes 

BB/Bt -I- (v • grad)B - (B • grad)v. (63.8) 

This sum is just the time derivative of B with respect to axes fixed in a rotating body. For 
the sum of the first two terms is the “substantial” time derivative (derivative following the 
motion) dB/dt, which gives the rate of change of B at a point moving with velocity v. The 
third term takes into account the change in the direction of B relative to the body; it is zero 
for pure translation (v = constant) and equals — 11 x B for rotation (v = H x r, where 11 is 
the angular velocity). 

To conclude this section, let us consider the phenomenon of unipolar induction, which 
occurs when a magnetized conductor rotates. If a stationary wire is connected to the 
rotating magnet by means of two sliding contacts A and B (Fig. 39) then a current flows in 
the wire. It is not difficult to calculate the e.m.f. which produces the current; the simplest 
procedure is to use a system of coordinates rotating with the magnet. If H is the angular 
velocity of rotation of the magnet, then in the new system the wire rotates with angular 
velocity - H. while the magnet is at rest. Thus we have a conductor moving in a given 
static magnetic field B due to a fixed magnet. We neglect the distortion of the field by the 
wire itself. According to formula (63.3), the e.m.f. between the ends of the wire is 

xB-dl = ^ |Bx(rxll)-dl, 

ACB 

taken along the wire. This is the required solution. 
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PROBLEM 1. Determine the magnetic moment of a conducting sphere (with = 1) rotating uniformly in a 
^Coftn static magnetic field, and the torque on the sphere. 

jolUTION. Let the external field have components §„ 0, in a fixed system of coordinates with 
l,£ r-axis in the direction of the angular velocity vector Si. In a coordinate system t;, z which rotates with the 
'"here, the field components are = ^^cosCit, §, = - §, sin fit, or, in complex form, 
ft = -ibx^ bz- 

Thus variable fields with frequency Ci act along the J and tj axes, and the magnetic moment which they induce is 
Kre(a§£)= (a'cos fit + a" sin fit), 

= Lre(aS„) = (- a'sin Ot + a" cos fit), 

where La is the complex magnetic polarizability of the sphere, which has been determined in §59, Problem 1. 
Along the z-axis, on the other hand, the magnetic field is constant, and therefore causes no magnetic moment 
lif p= 1). The components of the magnetic moment in the fixed system of coordinates are = La'S,,, 
= La" bxy-^z = 0- Thus in this problem a' and a" give the components of the magnetic moment of the sphere 
respectively parallel and perpendicular to the plane of the vectors Si and 

The torque on the sphere is K = .^x^. Its components relative to the fixed axes are 

K, = Fa" b. bz, f^y= - La' §„ S„ K^= - La" 

The above reduction of the problem of a sphere rotating in a field to that of a sphere at rest in a variable field is 
perfectly natural, in view of the comment at the end of the solution in §59, Problem 1. An interesting feature of the 
analogy is that, as the frequency w of the variable magnetic field increases, the field is squeezed out of the sphere; in 
the limit co -* oo, all the lines of force pass round the sphere, not through it. In a similar way, a magnetic field 
perpendicular to the axis of rotation is squeezed out of a rapidly rotating sphere. 

Problem 2. Determine the e.m.f. due to unipolar induction between the pole and the equator (Fig. 39) of a 
uniformly magnetized sphere rotating uniformly about the direction of magnetization. 

Solution. When the sphere rotates about its direction of magnetization, it generates a static field, and, 
since no currents flow within the sphere, we find from (63.6) that curl (v x B) = 0. Hence the integral of v x B along 
ihe closed contour OACBO (Fig. 39) is zero, and so the integration along ACB in formula (63.9) may be replaced 
y one along the path AOB. which lies inside the sphere. The integral along the segment AO of the axis of rotation 
^iszero, since Si and r are parallel; the integral along the radius OB gives, since B and fl are parallel within the 


<? = - j Bf,nrdr = Bona^lc, 

fte'ah° ** radius of the sphere and B„ the magnetic induction in it. In a uniformly magnetized sphere (in 
80--//*-*^ external field) the induction is related to the magnetization hy Bo + 2H = 0 (cf. (8.1)) and 
^ ~ ~ SjiMp. In terms of the total magnetic moment of the sphere we have finally 


3. Determine the total charge which flows along a close 
'gn the circuit changes for any reason from one constant value (0 

^LFtion. The required total charge is the integral 


near circuit when the magnet: 
to another (Oj). 


Tiict^ "^(0 is the induction current in tl 
•on J(t) that has the frequency <u = 


I <S’dt = Z(0) I Jdt; 


is integral is the Fourier component of the 
e corresponding component of the e.m.f. by 
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see (61.3). Putting Z(0) = R, where R is the resistance of the circuit to a steady current and S = — (l/c)d(J)/^^ I 
we have I 


§64. Excitation of currents by acceleration 

In discussing the motion of a conductor in §63 we have neglected possible effects of tl* 
acceleration, if any. The accelerated motion of a metal, however, is equivalent to the action 
of additional inertia forces on the conduction electrons. If v is the acceleration of the 
conductor and m the mass of the electron, then the force on an electron is — mv. It affects 
the electron in the same way as an electric field mv/e, where — c is the charge on the 
electron. Thus the effective electric field on the conduction electrons in an accelerated 

E' = E + mVc. (64.1) 

The current density is accordingly 

j = ctE' = eT(E + mv/c). 

Expressing E in terms of E' from (64.1), we substitute in the equation 
curl E = - (\lc)dnidt 


(64.2) 


(as usual, we put g=\). Then 

curl E' = — - ~ + — curl v. (643) 

c dt e 

We write v as a sum v = u + ft x r, where u is the translational velocity and ft the angular 
velocity of rotation of the body. Differentiating with respect to time, we find the 
acceleration tobev = u + ftxv + ftxr = u + ftxu + ftx(ftxr) + ftxr. The first two 
terms are independent of r, and therefore give zero on differentiation with respect to the 
coordinates. The third term can be written as ft x (ft x r) = — i grad (ft x r)^, and its cur 
is therefore zero. Finally, curl (ft x r) = 2ft. Thus, substituting for v in equation (64.3), wo 
have curl E' = — (l/c)5H/5t + 2mft/c or 


curl E' = 


I gH' 

c dt 


( 64.41 


where 


H' = H -Imc^le. 


Since ft is independent of the coordinates, the equation curl H — 47rj/c i 
is replaced by H': 


curIH' = 47rfTE7c. 


( 64.51 

; still valid if ^ 

( 64.61 


Eliminating E' from equations (64.4) and (64.6), we obtain for H' the equation 
AH' = (47r<7/c2)^H'/^t, 

which is the same as the equation for H in a conductor at rest. 


( 64.71 

i 


^ Excitation of currents by acceleration 223 

Outside the body, the field satisfies the equation A H = 0 (the wavelength being 
^l^osed large compared with the dimension of the body), and H' satisfies the same 

equation. 

finally, on the surface of the conductor H', like H, is continuous. The only difference is 
jp the condition at infinity, where H tends to zero but H' tends to the limit — ImcSlIe. 

thus the problem of determining the variable magnetic field H near a non-uniformly 
[Otating body is equivalent to that of determining the field H' near a body at rest in a 
uniform external magnetic field 

§=-2mcll/e. (64.8) 

pie required field outside the conductor is obtained by subtracting § from the 
solution H' of this latter problem. 

The magnetic field thus produced, like any variable field, induces electric currents in the 
conductor itself. In a simply-connected body, these currents appear in the form of a 
magnetic moment. In a non-uniformly rotating ring, the effect appears as an e.m.f.—the 
Stewart-Tolman effect. 

Misunderstanding may arise from the appearance of the angular velocity 11 itself, and 
not its time derivative, in formula (64.8). We may therefore emphasize that the above 
discussion, and therefore the significance here attached to the quantity (64.8), pertain only 
to non-uniform rotation. When 11 is constant, equation (64.7) with the appropriate 
boundary condition at infinity is identically satisfied by H' = §, and the definition (64.5) 
then gives H = 0. The magnetic field which arises from the gyromagnetic effect (§36) in 
uniform rotation is a small quantity which is here neglected. 

The derivation has also ignored the deformation of the body which results from non- 
uniform rotation. It can be seen that including this deformation would not affect the result 
if the characteristic time of variation of the angular velocity is (as we assume) much longer 
than the relaxation time of the conduction electrons in the deformation: the electric 
current in the conductor is due to the gradient of (p + Coff, where (p is the field potential 
the chemical potential of the conduction electrons (see §26). A non-uniform 
tieformation produces a gradient of Co, but this is compensated by the electric field which 
results from the thermodynamic equilibrium condition e4) + Co — constant. 


. Problem 


magnetic moment of the sphere in the field S(t) (64.8) is = FdS, where a is an operator 
comn Fourier components of the function &(t) is given by the formulae of §59, Problem 1. For the 

'*hen frequencies w such that 5 > a we have = Fa(m)§ s -inma^aUoSl/l5ce. This formula, 

‘^outain o) explicitly, and is therefore valid also for the 
P uons .^{t) and n(t) themselves, as well as their individual Fourier components (on the assumption that the 
chiefly terms whose frequencies satisfy the above condition). 


Determine the magnetic moment of a non-uniformly rotating sphere with radius a. The rate of 
assumed so small that the penetration depth a. 


^'’Uriere, 


2. Determine the total charge which flows along a thin circular ring when it 
’^tion about an axis perpendicular to its plane. 


ceases a uniform 




’Lution. In the formula obtained in §63, Problem 3, <1> m 


be taken as the flux of the field § (64.8). The 
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total charge transferred when the angular velocity changes from fl to zero is 


J dt = —- ilnb^ = — 


where b is the radius of the ring and V its volume. 

Problem 3. Determine the current in a superconducting circular ring which ceases to rotate uniformly. 
Solution. From the condition that the total magnetic flux through the ring be constant (see (54.5)), we have 
2mc^ j mc^bCl 

"'2e[iog(8fc/u)-2]' 

See the third footnote to §54 concerning the value of L. 



CHAPTER VIII 


MAGNETOHYDRODYNAMICS 


§65. The equations of motion for a fluid in a magnetic field 

If a conducting fluid moves in a magnetic field, electric fiplds are induced in it and electric 
currents flow. The magnetic field exerts forces on these currents which may considerably 
modify the flow. Conversely, the currents themselves modify the magnetic field. Thus we 
have a complex interaction between the magnetic and the fluid-dynamic phenomena, and 
the flow must be examined by combining the field equations with those of fluid dynamics. 

The applications of magnetohydrodynamics cover a very wide range of physical objects, 
from liquid metals to cosmic plasmas. We shall not discuss here the specific conditions that 
exist in various particular objects, but simply mention that for magnetohydrodynamics to 
be strictly applicable it is of course necessary that the characteristic distances and time 
intervals for the motion in question should be much greater than the mean free path and 
mean free time of the current-carriers (electrons or ions). In some cases, however, 
equations that are formally identical with those of magnetohydrodynamics in an ideal 
fluid may describe also the motion of a medium with a long mean free path. Such a 
situation occurs, for example, in a non-equilibrium plasma in which the electron 
temperature considerably exceeds the ion temperature (cf PK, §38). 

The magnetic permeability of the media considered in metnetohydrodynamics differs 
only slightly from unity, and the difference is unimportant as regards the phenomena 
under discussion. We shall therefore take n = 1 throughout the present chapter.! 

Let us first set up the equations of magnetohydrodynamics for conditions such that all 
dissipative processes may be neglected, i.e. for an ideal fluid. This means that no account is 
taken of viscosity and thermal conduction, or of the finite electrical conductivity a of the 
medium, which is regarded as being indefinitely great. 

Putting n -> oc in equations (63.7), we write 

divH=0, (65.1) 

dH/Bt = curl (v x H). (65.2) 

The equations of fluid dynamics are the equation of continuity 

Bp/Bt -F div (pv) = 0, (65.3) 

where p is the fluid density, and the Navier-Stokes (Euler) equation 

^ -f- (v • grad)v = - ~ grad P -f . 

Bt p p 

t In the literature on magnetohydrodynamics, the magnetic field under these conditions is often denoted by B 
to emphasize that it is the averaged microscopic field, h = B. We shall here use the notation H, however, for the 
sake of uniformity with the other chapters, in which non-magnetic media are discussed. 


225 



226 Magnetohydrodynamics 

where f is the volume density of the external (in this case, electromagnetic) forces. By 
formula (35.4) we have f = j x H/c = (curl H) x H/47r. Thus the equation of motion of the 
fluid is 

^ + (v • grad)v = - ^ grad P — ^ 

To these equations we must add the equation of state 

P = P{p,n (65.5, 

which relates the pressure, density and temperature of the fluid, and the equation of 
conservation of entropy, which expresses the fact that the motion is adiabatic in the 
absence of dissipation: 

ds/dt = ds/Bt + V • grad s = 0, (65.6) 

where s is the entropy per unit mass of the fluid and d/dt = B/Bt + v - grad denotes the 
“substantial” derivative giving the rate of change of a quantity at a point moving with the 
fluid particle. Equations (65.1)-(65.6) form a complete system of equations of magneto¬ 
hydrodynamics for an ideal fluid. 

The Navier-Stokes equation can be written (using the equation of continuity) in a form 
expressing the law of conservation of momentum: 

B{pvi)IBt= -BUJBx^ (65.7) 

where 11;^ is the momentum flux density tensor (see FM, §7). In the absence of external 
forces, rift = pViVi^ +PSii^. Transforming the last term in (65.4) by means of HxcurlH 
= AgradH^-(H-grad)H, and div H = BHJBx^ = 0, we have in magnetohydro¬ 
dynamics 

= pv,v^ + PS,^ - (H.-H, - (65-8) 


The added term is the Maxwell stress tensor, as it should be. 

The equation of conservation of energy in ordinary fluid dynamics is 


B(^pv^ +PE)/Bt = -divq. 


q = pv(jv^ + w); 


E and w = e + PIp are respectively the internal energy and heat function per unit 
fluid; this necessarily follows from the equation of motion (see FM, §6). When a magne^^^ 
field is present in the conducting medium, the energy density includes also the ^ 

energy H^/Sn, and the energy flux density includes also the Poynting ve 
S = cExH/47r. Expressing E in the latter in terms of H by 
E=-vxH/c, 

which follows from (63.2) when u oo and j is finite!, we find that the equation o 
conservation of energy in magnetohydrodynamics is 

-Hiva. 


yri ipr^ + pe+g 


t This expression corresponds to a zero field E' (63.1) in a frame of reference 
slement concerned: in a perfectly conducting medium, the electric field is completely screened. 


with the fluid voW^ 
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^here the energy flux density is 

q = pv(^i;^ + w) + Hx(vxH)/ 47 c. (65.11) 

j( is not difficult to verify this by direct calculation, 

Xhe above equations of magnetohydrodynamics are based on neglecting the displace- 
pient current in Maxwell’s equations. We thus assume that 

(l/c)|aE/atH |curlH|. (65.12) 

Expressing E in terms of H by (65.9) then gives 

vljc^z < 1, (65.13) 


where 1 and t are length and time parameters characteristic of the motion concerned. From 
(65.2), //t ~ V, and we then find from (65.13) the condition v<^c: the motion must be non- 
relativistic (as we have assumed from the start). Equation (65.4) gives pr/r ~ in 
combination with (65.13), this yields a condition on the magnetic field: 

« pc^. (65.14) 

It should be noted that the left-hand side of (65.10) does not include the electrical energy 
EVStc, and that of (65.7) does not include the electromagnetic field momentum S/c^. This 
is a necessary consequence of neglecting the displacement current. The smallness of the 
electrical energy in comparison with the magnetic energy corresponds to the inequality 
£ ~ r/7/c ff, and that of S/c^ ~ EHjc ~ vH^/c^ in comparison with pv corresponds to 
the inequality (65.14). 

Let us return to the equation (65.2), which has an important physical interpretation 
(H. Alfven, 1942). We expand the right-hand side, using the fact that div H = 0: 


dH/dt = (H • grad) v - (v • grad)H - H div v. 
Substituting from the equation of continuity (65.3) 


_ I dp vgradp 

pat p 

""e obtain after a simple rearrangement of terms 


•grad 


dt p 



(65.15) 


^t us now consider some “fluid line”, i.e. a line which moves with the fluid particles 
'Composing it. Let a 1 be an element of length of this line; we shall determine how ai varies 
"'•th time. If v is the fluid velocity at one end of the element ai, then the fluid velocity at the 
°ther end is v + (^1 • grad)v. During a time interval dt, the length of ai therefore changes 
ydt(^l •grad)v,i.e. d(ai)/dt = (51 • grad)v. We see that the rates of change of the vectors 
. ‘ and H/p are given by identical formulae. Hence it follows that, if these vectors are 
"•hally in the same direction, they will remain parallel, and their lengths will remain in the 
^nie ratio. In other words, if two infinitely close fluid particles are on the same line offeree 
any time, then they will always be on the same line of force, and the value of H/p will be 
Proportional to the distance between the particles. 

Passing now from particles at an infinitesimal distance apart to those at any distance 
^Part, we conclude that every line of force moves with the fluid particles which lie on it. We 
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can picture this by saying that (in the limit u oo) the lines of magnetic force are “frozeDn 
in the fluid and move with it. The quantity Hjp varies at every point proportionally to 
extension of the corresponding “fluid line”. If the fluid may be supposed incompressiblf 
p = constant, and the field H varies as the extension of the lines of force. 

These results can be viewed in another way: as any closed fluid contour moves about ij 
the course of time, it cuts no line of force. This means (cf §63) that the flux of the magnetic 
field through any surface spanning the fluid contour does not vary with time. 


§66. Dissipative processes in magnetohydrodynamics 

In ordinary fluid dynamics the dissipative processes are governed by three quantities, | 
namely two coefficients of viscosity and the thermal conductivity. In magnetohydrody. 
namics the number is considerably greater, both because new electrical quantities occur 
and because there is at each point a distinctive direction, that of H, which makes the fluid 
no longer isotropic. We shall, however, take only the simple case where all the kinetic 
coefficients may be regarded as constant throughout the medium, and in particular as 
being independent of the magnitude and direction of the magnetic field. There is then only 
one quantity, the electrical conductivity o, to be added to the usual viscosity coefficients ij 
and C and the thermal conductivity K.f 

The assumption that the kinetic coefficients are independent of the magnetic field 
implies that certain conditions are satisfied which considerably restrict the range of validity 
of the equations in comparison with that of the equations of magnetohydro¬ 
dynamics for an ideal fluid. The mean free path of the current-carriers musi be much less 
than the radius of curvature of their orbits in the magnetic field; that is, the collision 
frequency must be large compared with the Larmor frequency of the carriers. This 
condition is not satisfied in a highly rarefied medium or in a strong magnetic fieldj 

When viscosity and electrical conduction are taken into account, equation (65.2) is 
replaced by the complete equation (63.7): 

dH/dt = curl(vxH)-l-(c^/47C(T)AH, (66-1) 


and equation (65.4) by the (complete) Navier-Stokes equation 

^ + (vgrad)v = --grad P + ~ Av + -(C + 4f;)graddivv--^HxcurlH. 
dt p p p ^np 


( 66 . 2 ) 


Equation (66.1) does not involve the viscosity, and so the “freezing” of the lines of force as 
(T -> 00 continues to occur even in a viscous perfectly conducting fluid. 

The adiabatic equation (65.6) is replaced by the equation of heat transfer. In ordinal 
fluid dynamics the latter is 


pT^^ + vgrad 


= + div(K grad 7"): 


t The relation between the current and the electric field in a thermodynamically non-uniform 
isotropic at every point in volves also the thermoelectric coefficient a (§26). However, if this coefficient is con 
it does not appear in the equations of motion. . 

t The problem of the equations of magnetohydrodynamics in a plasma where these conditions are not sati 
is dealt with elsewhere; see PK, §§58, 59. , 
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see §49. The left-hand side of the equation is the quantity of heat generated per unit 
jjpie and volume in a moving fluid particle. The right-hand side is the energy dissipated per 
ypit time and volume. The first term is due to viscosity; is the viscous stress tensor: 

+ div V. 

phe second term gives the dissipation due to thermal conduction. In a conducting fluid, a 
term giving the Joule heat must be added. The rate of evolution of this heat per unit volume 
is = (c^/167c^(T)(curl H)^. The equation of heat transfer in magnetohydro¬ 
dynamics is therefore 

/ds \ dv- 

•gn.dsj = « A r+(curl H)l ( 663 ) 

The momentum flux density tensor contains in addition the viscous stress tensor: 

Hilt = pViV, + PSn - (66.4) 

The heal flux density is now 

1 

q = pv(yn^-l-w) — (v(t') — k grad 7’-l- — Hx(v X curl H, (66.5) 

47c Ibn o 

where va' is a vector whose components are This includes extra terms due to 

viscosity, thermal conduction and electrical conduction, the last of which is obtained by 
substituting in the Poynting vector the field E from (63.2): 

E = j/a-vxH/c 

= (cjAnc) curl H - v x H/c. (66.6) 

The equations are somewhat simplified if the moving fluid can be supposed incom¬ 
pressible. The equation of continuity then reduces to div v = 0, and in equation (66.2) the 
penultimate term is zero. We shall write out here the appropriate system of equations (in 
equations (66.1) and (66.2) it is convenient to transform the terms curl(vxH) and 
fl><curlH by means of the appropriate formulae of vector analysis): 

div H = 0, div V = 0. (66.7) 

dHIdt + (v • grad)H = (H ■ grad)v + (c^lAna) A H, (66.8) 

^ + (V • grad)v = - i grad grad)H -I- v A v, (66.9) 

ot p \ Sn J 4np 

"'here v = rijp is the kinematic viscosity. Equation (66.3) is not needed in solving the 
Problem of incompressible flow unless we are interested in the temperature distribution. 

In ordinary fluid dynamics, the Reynolds number describes the role of viscosity terms in 
‘he equations of motion, in relation to the convection terms: R = ul/v, where / and u ~ lit 
^•■e the characteristic parameters of length and velocity for a particular motion of the fluid, 
•n magnetohydrodynamics, we can supplement this by the magnetic Reynolds number 

v„ = c^/4nu. (66 10) 

"'hich describes the role of the conduction term in equation (66.1). This term is analogous 
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to the term v A v in the Navier-Stokes equation, and v„ acts as a “diffusion coefficient” qj 
the magnetic field. When R„ ^ 1, this term may prove to be negligible. However, there is 
general answer to the question of when in fact dissipative processes in the fluid may 
neglected, since the relevant conditions depend greatly on the nature of the motion and are 
for instance, completely different for steady and non-steady flows. 

In the opposite limiting case of a poorly conducting fluid, R„ 1, the equations of 
magnetohydrodynamics can be considerably simplified (S. I. Braginskil, 1959). The reason 
is that in this case the magnetic field is only slightly perturbed by the motion of the fluid. If 
the unperturbed field § is independent of time (as we shall assume), the change H' in this 
field in the moving fluid may be estimated by comparing the two terms on the right of 
(66.1): curl(vX§) ~ v„ AH', whence H'~R„$, and when R„<^ 1 we in fact have 
H' Neglecting this change, we can suppose that the magnetic field H is the same as the 

field § that would result from the external sources in vacuum. Since § is constant, curlE 
= — (l/c)5§/5t = 0, i.e. the electric field has a potential: E = — gradc^. An equation for 
the potential (j) can be derived from divj = 0, which is satisfied identically when the 
displacement current is neglected (i.e. because of the relation curl H = 47ij/c). Substituting 
the current density j = (t(— graded+ vx§/c) and noting that curl§ = 0 for the 
unperturbed field, we find (if a = constant) 

A(^ = §-curlv/c. (66.11) 

The second equation is the Navier-Stokes equation 

^ + (v • grad)v = — — grad P -h v A v -h f (66.12) 

at p 

(for an incompressible fluid), in which the volume density of external forces is 
f= jx§/c 

= ((T/c)[§xgrad(^ + §x (§xv)/c]. (66.13) 

Equations (66.11)-(66.13) are the appropriate ones for this case. 

§67. Magnetohydrodynamic flow between parallel planes 

An instructive example of the magnetohydrodynamic flow of a viscous conducting flui^ 
occurs in steady flow in the space between two parallel solid planes when a unifor® 
magnetic field § is applied perpendicular to the planes (J. Hartmann, 1937). This is t ^ 
simplest analogue of Poiseuille flow in ordinary fluid dynamics. 

It is natural to assume that the velocity of the fluid is everywhere in the same directio^ 
(which we take as the x-direction), and depends only on the coordinate z (whose direction^^ 
perpendicular to the planes). The same is true of the longitudinal field which ^ 
from the movement of the fluid. The pressure P, however, depends on x also, because t 
must be a pressure gradient in the direction of the motion in order to maintain a 
flow. The equation divv = 0 is satisfied identically, and from div H = 0 it follows t 
= constant = The z-component of (66.9) shows that P + H//Sn is a function o 
only. Since H, is independent of x. the pressure gradient dP/dx might be a function oi ^ 
alone, but actually (because of the uniformity in the x-direction) it is a constant — A < 
(where AP is the pressure drop over a distance /). i 


^7 Magnetohydrodynamic flow between parallel plans 

The x-components of (66.8) and (66.9) give 
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di) 


(67.1) 


^ ^ dtf, 

^ dz^ An dz 


= constant = 


AP//. 


(67.2) 


7he boundary conditions on the solid surfaces are that the velocity of the viscous fluid be 
2ero and that the tangential component of the magnetic field be continuous: 

r = 0, H^-0 for z = ±a. 


where 2a is the distance between the solid planes and z = 0 lies half-way between them. The 
solution of (67.1) and (67.2) which satisfies these conditions is 

_ cosh (a/^) — cosh (z/^) c jri 

cosh(a/^)—1 ’ <t’ 

_ 471/ (z/fl) sinh (fl/^) - sinh (z/^) 

X Vo ^ s/ 

The constant Oq is the velocity of the fluid in the medial plane z = 0. Its relation to the 
pressure gradient may be found by substituting (67.3) in (67.2). The velocity averaged over 
the cross-section is 



The effect of the magnetic field on the flow, in comparison with that of the viscosity, is 
described by the quantity 

G = al6 = («S/c)V'(r7/r;), (67.5) 

'^lled the Hartmann number. When G 1 we have 

r = Do (1 - zVfl"), V = {APIl)a^l3ri, (67.6) 

*hat is, ordinary Poiseuille flow. When 1, 



AP ac 


(67.7) 


^hen the magnetic field increases, the velocity profile is flattened over the greater part of 
he Cross-section, and the mean velocity is reduced (for a given pressure gradient); the 
decrease in the velocity occurs mainly in layers of thickness ~ 5 near the planes. 

The motion of the fluid gives rise to an electric field in the y-direction. Since the 
diotion is steady, curl E = 0, and hence = constant. The current density in the fluid 
'*4 = a V f>/c). The total current through a cross-section of the fluid is zero: since also 
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jy = {c/4n) (curlH)j„ we have 



Hence 


and so 


J j^dz = lacrE^ — — j" ud 


= v^/c. (67,8) 

§68. Equilibrium configurations 

The equilibrium of a perfectly conducting fluid (referred to here for clarity as a plasma) 
at rest in a constant magnetic field is described by the equations 

gradP = jxH/c, (68.1) 

j = c curl H/47C, (68.2) 

div H = 0. (68.3) 


The first of these is equation (65.4) with v = 0 and written in terms of the electric current 
density, which is related to the magnetic field by Maxwell’s equation (68.2). In the present 
section, we shall consider some general properties of equilibrium configurations derivable 
from these equations, but without entering into the complex and manifold problems of the 
stability of such configurations.! 

Scalar multiplication of (68.1) by H and by j gives 

(H • grad)P = 0, (j • grad) P = 0; 

that is, the pressure gradient is zero along the lines of magnetic force and along the current 
lines. Thus both sets of lines lie on surfaces 

P(x, y, z) = constant, 

called magnetic surfaces. In principle, every magnetic surface could be the boundary of a’’ 
equilibrium configuration.! 

The equilibrium equations (68.1), (68.2) can also be put in the form 

dUJdx^ = 0, n,., = P^„ - (HiH, - 

if we start from the equation of motion in the momentum conservation form (65.7), 

We multiply this equation by x^, and integrate over some volume bounded by a c os 


t The main results on this subject (within the scope of magnetohydrodynamics) are given by B. B. Kado 
Reviews of Plasma Physics 2, 153, 1966. . . . , „t 

t The pressure P is determined by equations (68.1) and (68.2) only to within an arbitrary additive cons 
Hence any of the magnetic surfaces can be P = 0. 
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0 {face. Integration by parts givest, since dxjdx^ = 6 ^^, 

Jni,.dl''= f (68.7) 

Qp substitution of 13;^ from (68.6), this equation becomes 

§(3P + H^/^n)dV= f {(P + HV87c)r-(HT)H/47c}-df (68.8) 

lS Chandrasekhar and E. Fermi. 1953). 

Let the plasma occupy some finite volume outside which P = 0, and let there be no fixed 
field sources (rigid current-carrying conductors) outside it. Then the field far from the 
plasma decreases as 1/r^, and, if the integration is taken over all space, the surface integral 
is zero. The integral of the positive quantity 3P + H^l^n cannot, however, be zero. It is 
therefore not possible for an equilibrium configuration bounded in space to exist without 
being maintained by a magnetic field from external sources; when such sources are present, 
the right-hand side of (68.8) becomes an integral over their surfaces, and the condition can 
in principle be satisfied (V. D. Shafranov, 1957). 

Let us consider the simplest unbounded configuration, a cylindrical plasma pinch of 
unlimited length and uniform along its length. In cylindrical polar coordinates r, 4), z, with 
the z-axis along the pinch, all quantities depend on the radial coordinate r only. The radial 
component must be zero, since otherwise it would become infinite as r->0, in 
accordance with the equations 

1 d 

div H = - — (rH^) = 0, H^ = constant/r. 


The same is true of j, because of the equation div j = 0 which necessarily follows from 

( 68 . 2 ). 

The components of equation (68.2) give 


c dH, . c d , „ 

Prom the second of these, 

H^ = 2J{r)lcr, J(r) = ^fi-lnrdr. 

0 

Phen equation (68.1) becomes 


(68.9) 


dP _ 1 dJ\r) 1 dtf,^ 

dr 2 nc^r^ dr ^ Stc dr ' 


( 68 . 10 ) 


^Two particular cases can occur here which are significantly different. In one, the z pinch, 
th ~ ^ ~ Multiplying (68.10) by r^ and integrating with respect to r from zero to 

^Pinch radius a, with the boundary condition P(a) = 0, we find as the equilibrium 
^■dition 


I 


P(r) • 27crdr = J^(a)l2c^, 


( 68 . 11 ) 


^ This derivation is similar to the familiar derivation of 


the virial theorem (Fields, §34). 
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where J{a) is the total current through the pinch (W. Bennett, 1934). The equilibrium 
configuration is here contained by the field of the longitudinal current. 

In the other case, the theta pinchf, H^ = 0 and = 0. In this case, we have from (68.10j 

P + H.VStc = (68.12) 

where $ is the longitudinal magnetic field outside the pinch. Here the plasma is contained 
by the external longitudinal field. 

In an arbitrary space-bounded axially symmetric configuration, the radial components 
and may be non-zero (a toroidal configuration). Moreover, all quantities may then 
depend on z as well as on r. 

Equations (68.1)-(68.3), written in components, are 


-j^H^ = cdPjdr, -j^H^ = cdPjdz, 

. c /gtf, BHA 
4n dz ’ 4nr dr ^ ^ 47C \ 5z 5r / 


(68.13) 

(68.14) 

(68.15) 


A consequence of these equations, which is already evident from the vector form (68.1), is 
that if the current density distribution is azimuthal {j^ = = OJ^ + 0) the magnetic field is 

meridional = 0). If the magnetic field is azimuthal, a stronger conclusion is possible: 
not only is the current density meridional, but the whole equilibrium configuration must be 
a z pinch = 0, and independent of z), as is easily seen by eliminating P from the first 
two equations (68.13) and then using the other equations. 

Equations (68.13)-(68.15) can be reduced to a single equation in the following way 
(V. D. Shafranov, 1957; H. Grad, 1958). We define the quantities 


tA (r, z) = J H, • 2 nr dr, J (r, z) = j ■ 2nr dr, (68.16) 

0 0 

the magnetic flux and total current through a circle of radius r perpendicular to the z-axis- 
From these definitions and the equations div H = 0, div j = 0, we find the meridiona 
components of the field and the current density: 


H H 

' 27ir dz ’ ^ 27tr dr' j7, 

2 nr dz' 2nr dr' 

These expressions show that the gradients of i]/ and J are orthogonal to the line of magnej^^ 
force and the current line respectively. In view of the discussion of the surfaces (68.5) al 
beginning of this section, we can conclude that ip and J are constant on the magij^ 
surfaces, and therefore any two of ip. J and P can be expressed as functions of the t 


t The 


the angle in cylindrical polar coordinates, which is often denoted by 0. 
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ipe. In particular, 

p = P(^), J = J(^). (68.18) 

azimuthal components of the field and the current are expressed in terms of ij/ and J by 
^eans of equations (68.14): 


2J c Idxjj 

cr" 87c^r\5r^ r dr ^ dz^)' 

Finally, substituting these results in the first equation (68.13), we find 

aV I# 3 

dr^ r dr ^ dz^ ^ dxj/ dxj/ ' 


(68.19) 


( 68 . 20 ) 


On specifying any particular functions P((A) and J(i}/) and solving this equation, we 
obtain some equilibrium configuration that is in principle possible; the field and current 
distributions in this configuration are given by (68.17) and (68.19), and the magnetic 
surfaces are xj/ (r, z) = constant. 

As an illustration, the expression 

(A/tAo = Ul’R'' + +i(« - (68-21) 

is the solution of (68.20) with dP/dip = constant and dJ^fdxl/ = constant; xpd, a, b and R 
are constants, with 

\6n^dPld\li = -fltAo, (87i^/c^)dJVdtA = —bR^xpo- 
This solution describes a toroidal configuration consisting of nested toroidal magnetic 
surfaces xj/ = constant, any of which may be taken as the plasma boundary P = 0. The 
innermost surface degenerates to a curve, the circle r = R,z = 0, called the magnetic axis. 
Near the magnetic axis, 

(A/tAo = + i)z" + -!)('•- -R)"- 

Thus, if fc +1 >0 and a > 1, the cross-sections of the magnetic surfaces near the axis are 
ellipses. With increasing distance from the axis, xj/ increases and the pressure falls. Outside 
•he surface P = 0 (the plasma boundary), the magnetic field needed to maintain 
equilibrium is given by equation (68.20) with zero on the right-hand side and the boundary 
conditions that xj/ and its normal derivative be continuous. 


Hydromagnetic waves 

Let us consider the propagation of small disturbances in a homogeneous conducting 
•••edium in a uniform constant magnetic field . We shall assume that the fluid is ideal, i.e. 
’'CSlect all dissipation processes in it.t 

^We start from the equations of magnetohydrodynamics, (65.1)-(65.4). The adiabaticity 
^■ution (65.6) signifies only that, if the unperturbed medium is homogeneous, 
* ^ constant in the perturbed medium also, i.e. the flow is isentropic. 

TVe write 

^ H = Ho -I- h, P = Po + P\ P = Po + P\ 

y^The condition for this approximation to be valid is that the wave damping coefficient (calculated in the 
^Bblem at the end of this section) should be small. 
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where the suffix 0 denotes the constant equilibrium value, and h, p' and P' are the sinaj| 
variations in the wave. The velocity v, which is zero in equilibrium, is a small quantity of ti)^ 
same order. Since the flow is isentropic, the changes in pressure and density are related bj 
F = uo^p', where = {SPISp)s is the square of the ordinary velocity of sound in tbj 
medium concerned. Neglecting terms of higher order than the first in equatioju 
(65.1)^(65.4), we obtain the linear equations 


div h = 0, dh/dt = curl(v x h), 
dp'/dt + p di\ y = 0, 

d\/dt = — (mq^/p) grad p' — (H x curl h)/47rp. 


( 69 . 1 , 


Here and in what follows we omit, for brevity, the suffix zero to the equilibrium values. 

We shall seek solutions of these equations as plane waves proportional to 
exp [i(k T-mt)]. The system of equations (69.1) then gives the algebraic equations 


— mh = k X (v X H), cup' = pk • v, 
— CUV + (uo^/p)p'k = — H X (k X h)/47rp; 


the equation k • h = 0, which follows from div h = 0, is automatically satisfied and need not 
be considered separately. 

The first of these equations shows that the vector h is perpendicular to the wave vector k. 
which we shall take to be along the x-axis, with the plane of k and H as the xy-plane. We 
also introduce the phase velocity of the wave, u = co/k. Eliminating p' from the third 
equation by means of the second equation, and rewriting the result in components, we have 


uh^=^—v^H^, uv^= —H^h^lAnp, ( 69 . 3 ) 

= — HJi^/4np, 1 

/ (69-4) 

I u-j = H^h^/Anp. 


We have here separated the equations into two groups, the first involving only K 
and the second only h^, and v^. It therefore follows that perturbations of the two groups 
of variables are propagated independently. The density, and therefore tl^e pressure, 
propagated with the perturbations h^, and u^,, being related to by 


The compatibility condition for the two equations (69.3) is 
u = u^^lHJ/y(AKp); 

we shall assume that > 0, and omit the modulus sign. In these waves the component^, 
of the magnetic field which is perpendicular to the directions of propagation and o 
constant field H oscillates, and with it the velocity u.. which is related to h^ by 


= -hjJlAnp). 

The relation between cu and k (the dispersion relation, 
involves the direction of the wave vector; 

cu = H ■k/y(47tp). 


it is called) given by 


(69.^' 


(69-8' 

A 


^ nyufurnuynciic y\/uuc^> / 

r|,e physical velocity of propagation of the waves is called the group velocity and is given by 
derivative dco/dk. In the present case we have 

dco/dk = U/^(4np). (69.9) 

ijich does not involve the direction of k. The direction of propagation of the wave, in the 
jnse of the direction of its group velocity, is the direction of H. These are called Alfven 
(H. Alfven, 1942), and (69.6) is the Alfven velocity. 

Let us now consider waves described by the equations (69.4), called magnetosonic waves. 
Equating to zero the determinant of these equations, we find an equation quadratic in u^, 
„t,ose roots are 





, y _ H ^ 


- 

^{471^’'’“' 



'J 1 


We thus obtain two further types of wave. The waves corresponding to the plus and minus 
signs in (69.10) are called respectively fast and slow magnetosonic waves. 

In the limiting case where 4npUff we have = Uq, and it follows from equations 
1(9.4) that Vy<^v^. In other words, in the limit the fast magnetosonic waves become 
ordinary sound waves propagated with velocity Uq . The weak transverse field in the wave is 
related to by hy = v^Hy/Uf^. In the same limiting case, the velocity of the slow 
magnetosonic wave becomes the Alfven velocity , with v^ = 0,Vy= — hy/^{4np), as in a 
wave of the first type, but with a different polarization: the vectors v and h are in a plane 
through k and H, not perpendicular to it. 

In an incompressible fluid (corresponding formally to the limit co) only one type of 

wave remains, namely Alfven waves with two independent directions of polarization. The 
dispersion relation for these waves is given by (69.8); the vectors v and h are perpendicular 
to the wave vector and are related by 

v=-h/y(47tp). (69.11) 

^he re is a simple interpretation of the fact that, in a longitudinal magnetic field, 
j^fterse displacements of the fluid are propagated in the form of waves. Because the lines 
“ force are “frozen in”, the transverse displacement of the particles results in a curvature of 
lines, and therefore in their stretching and, at some points, in their compression. The 
tofoes in a magnetic field (expressed by the Maxwell stress tensor) are such as would occur 

0 lines of magnetic force tended to contract and at the same time to repel one another.f 
, nee a curvature of the lines results in quasi-elastic forces which tend to straighten them, 
'y'ng to further oscillations. 

nt us now return to formulae (69.4) and (69.10), and consider the opposite limiting case, 
•ic ^ 47tpuo^. We then have, in the first approximation, Uj- = HlyJ{4np). Since this 
ilJPnession is independent of k, the group velocity is of magnitude Uj^ and its direction is 
. k. In this wave the vector v is perpendicular to H (Fig. 40. p. 238), and its magnitude 
^ &veti in terms of h = \hy\ by v = hj y/i^np). For we have in this limiting case 
The group velocity is dco/dk = UoH/H. The vector v in this case is anti- 

^Uel to H, and its magnitude is given by r = hH^/4npuoHy. 

''^hen the relation between and puo^ is arbitrary, both Uj^ and depend on the 
let aline of force be along the z-axis. Then the longitudinal stress n (65.8) contains a necative term 


238 


M agnetohydrodynamics 



direction of the wave vector. When the angle between k and H increases, increases 
monotonically and decreases monotonically. It is easy to see that the inequalities 


Uf, Uf $ Uo, u, < Uo (69.12) 

always hold. If k is parallel to H, Uy and are respectively equal to the greater and the 
smaller of Uq and = H/^{47ip). If k is perpendicular to H, then 

while Uyi and are zero, i.e. only the fast magnetosonic waves exist. 

Lastly, let us consider two exact solutions of the equations of magnetohydrodynamics in 
the form of a plane wave with any amplitude (not necessarily small). 

One of these is a plane Alfven wave in an incompressible fluid, propagated with velocity 
; that is, involving x and t only in the combination x — u^t. To show this, we go back to 
the exact equations (65.1)-(65.4). The equation of continuity (65.3) in an incompressible 
fluid becomes div v = 0, whence = constant; without loss of generality, we can put 
= 0, by an appropriate choice of the frame of reference. The equation div H = 0 gives 
= constant. Denoting the transverse components of H by h, we find from (65.2) and 
(65.4) 

f’h _ ^ dv d\ _ dh 

dt ^dx’ dt y(47tp)^^’ 

i.e. the exact equations necessarily become linear equations describing a plane wave with 
the phase velocity (69.6) and v and h related by (69.11). The wave profile, i.e. the function | 
h(x-u,4t), is arbitrary. The x-component of (65.4) gives 


whence 


1 dP 

p dx 


1 ^ dh 

47tp dx 


= 0 , 


P + h^/Sn = constant, 

which determines the variation of the pressure in the wave. 


(69.14) 
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The 


other case is a simple wave propagated at right angles to the magnetic field 


^ ICaplan and K. P. Stanyukovich, 1954). Let the field be in the y-direction, and the 
again propagated in the x-direction. Then = 0, = H, and the equation 




= 0 is satisfied identically. Equations (65.2)-(65.4) give 
dH/dt + dKH)/dx = 0, 
dp/dt + d(v^p)ldx = 0, 


(69.15) 

(69.16) 


dt dx 8np dx p dx' 


(69.17) 


Pfom the first two of these equations it is easily seen that the ratio Hjp = b satisfies the 
equation db/dt + v^dbfdx = 0 or dfc/dt = 0, where the total derivative signifies the rate of 
change in a given fluid particle as it moves about. Hence, if the fluid is homogeneous at 
some initial instant, so that b is constant, then at all subsequent instants we have 
h = constant. Substituting H = pb into the third equation, we obtain 


dt dx 


1 d 

p dx 



(69.18) 


Thus.the magnetic field has been eliminated from the equations, and the problem reduces 
to the solution of equations (69.16) and (69.18). These equations differ from those for one¬ 
dimensional motion in ordinary fluid dynamics only by a change in the equation of state of 
the gas: the true pressure P = P(p) (for given entropy s) must be replaced by P*(p) = 
P(p) + b^p^lSn. This fact enables us to apply the results of ordinary fluid dynamics to this 
case of magnetohydrodynamic flow. In particular, the formulae giving the exact solution 
for one-dimensional travelling waves (Riemann’s solution; simple waves, see FM, §94) can 
be applied, the role of velocity of sound being played by 


u* 



® accordance with formula (69.13). 



PROBLEM 

c'ermine the absorption coefficient (assumed small) for an Alfven wave in an incompressible fluid. 

absorption coefficient for a wave is defined as y = QI2q, where Q is the (time) average energy 
^r^se ° volume, and ^ is the mean energy flux density in the wave The amplitude of the wave 

^“‘<1 vve " during its propagation. Q is given by the right-hand side of equation (66.3); in an incompressible 
Pave for a wave propagated in the x-direction (so that r, = 0) 

Q = nidy/dxf + (cVlbTi^CT) (ch/dx)^ 

" the 

bn5,,/'’^8y flux density (66.5), we can omit the small dissipative terms, leaving = -H h-v/47r. Using 
(69.6) and (69.11), we have the result 
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§70. Conditions at discontinuities 


The equations of motion for an ideal magnetohydrodynamic medium admit discontinu¬ 
ous flows as in ordinary fluid dynamics. To elucidate the conditions which must be satisfied 
on a surface of discontinuity, let us consider an element of the surface and use a system of 
coordinates in which it is at rest.f 

First of all, the mass flux must be continuous at a surface of discontinuity: the mass 
of gas entering from one side must be equal to the mass leaving on the other side. Thus 
PiVin = where the suffixes 1 and 2 refer to the two sides of the discontinuity, and the 

suffix n denotes the component of a vector normal to the surface. In what follows we shall 
denote the difference between the values of any quantity on the two sides of the surface of 
discontinuity by enclosing it in square brackets. Thus [pr„] = 0. 

Next, the energy flux must be continuous. Using the expression (65.11) and omitting the 
dissipative terms, we obtain 

[^r.] = +w) + v„H^/4n-H„\-11/471] = 0. 

The momentum flux must also be continuous. This condition means that = 0, 

where FIj^ is the momentum flux density tensor, and n is a unit vector normal to the surface. 
Using (65.8), we therefore have 

[P + pv„^ + (H," -H„")/87t] = 0, 

[pv„\,-H„H,/47t] = 0, 

where the suffix t denotes the component tangential to the surface. 

Finally, the normal component of the magnetic field and the tangential component 
of the electric field must be continuous. If the conductivity of the medium is infinite, 
the induced electric field is given by E = — v x H/c, and the condition [E, ] = 0 leads to 
[//„v,-H,rJ = 0. 

In what follows it is more convenient to use the specific volume of the gas (V = 1/p) in 
place of its density. The mass flux density through the discontinuity is denoted by 


j = pv„ = vjV. (70.1) 

Since j and H„ are continuous, we can write the remaining boundary conditions in the 
following form: 

j[w + ^fV^+ + FH, V47t ] = H„ [H, • V, ]/47t, (70.2) 

[P]+f[V]+ [H,2 ]/87t = 0, (70.3) 

f[v,] = H„[H,]/47t, (70.4) 

H„[\,]=j[Vn,l (70.5) 


This is the fundamental system of equations of discontinuities in magnetohydrodynamics. 


§71. Tangential and rotational discontinuities 

In ordinary fluid dynamics, discontinuities of two different kinds are possible: shock 
waves and tangential discontinuities. Mathematically, the two types occur because some 


t This condition fixes only the velocity of the coordinate system in the direction normal to the surface. Any 
constant vector may be added to its tangential velocity. 
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j the boundary conditions can be written as the vanishing of a product of two factors, and 
(vvo different solutions are obtained by equating the factors to zero in turn. This feature 
’ j,ot present in the equations (70.2)-(70.5) of magnetohydrodynamics, and it might 
J^jgfefore be supposed that only one type of discontinuity occurs. In reality, however, it is 
. ypd that essentially different types of discontinuity again occur (F. de Hoffmann and E. 
Teller, 1950). 

Let us consider, first of all. discontinuities for which j = 0. This means that ri„ = V2„ 
, 0, i e- Huirl moves parallel to the surface of discontinuity. If # 0, we see from 
^^ipns (70.2)-(70.5) that the velocity, pressure and magnetic field must be continuous. 
The density (and therefore the entropy, temperature, etc.) may have any discontinuity. 
Such a surface may be called a contact discontinuity, and is simply the boundary between 
two media at rest which have different densities and temperatures. 

If both j and H„ are zero, then three of the four equations (70.2)-(70.5) are satisfied 
identically, and therefore this is clearly a special case. We thus find a type of discontinuity 
which may be called a tangential discontinuity, as in ordinary fluid dynamics. At such a 
discontinuity the velocity and the magnetic field are tangential and can have any 
discontinuity in both magnitude and direction: 

j = 0, H„ = 0. [v,]^0, [H,]^0. (71.1) 

The density discontinuity also can take any value, but the pressure discontinuity is related 
to that of H, by equation (70.3): 

[V]^0. |^P + ^j = 0. (71.2) 

The discontinuities of the other thermodynamic quantities (entropy, temperature, etc.) are 
related to those of V and P by the equation of state. 

Another type of discontinuity is one in which the gas density is continuous. Since the flux 
is continuous, the normal velocity component is therefore continuous also: 

j^O, [F]=0, [rj = 0. (71.3) 

On the right-hand side of equation (70.5) we can take V outside the brackets and divide this 
equation by equation (70.4), obtaining 


j = H„/^{dnV) 

(71.4) 

[V,] = x/(F/47t)[H,], 

(71.5) 


Equation (70.2) we put w = e-|-Pr;since F is continuous, this equation can be rewritten 

replaced in accordance with (71.4). The second term is zero by (70.3) and the third 
IS zero by (71.5), so that we find [e] = 0, i.e. the internal energy also is continuous, 
other thermodynamic quantity is determined if e and V are given. Hence all the other 
i^riaodynamic quantities, including the pressure, are continuous. It then follows from 
■3) that H/ is continuous, i.e. 


[PJ = 0, [f/,] = 0. 


(71.6) 
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The fact that H, and are both continuous means that the magnitude of H itself and its 
angle to the surface are likewise continuous. 

Formulae (71.3)-(71.6) give all the properties of the discontinuities under consideration. 
The thermodynamic quantities are continuous, but the magnetic field is turned through an 
angle about the normal, its magnitude remaining unchanged. The vector H,, and therefore 
(by (71.5)) the tangential velocity component, are discontinuous, but the normal velocity 
component v^=jV is continuous, and its value is 

K = y (VI An) = HjJiAnp). (71.7) 

These are called rotational or Alfven discontinuities. 

It is useful to note that, by a suitable choice of the coordinate system, we can always 
ensure that the gas velocity is parallel to the field on each side of a rotational discontinuity. 
To achieve this, we use a coordinate system moving with velocity Vi, —Hi,y(F/47t) = 
Vj, — H2,y(F/47t). (Compare the footnote to §70.) In the new system the ratio of each 
component of v to the corresponding component of H on either side of the discontinuity is 
y(F/47t), i.e. 

Vi = Hi y(F/47t), V2 = H 2 y(F/47t). (71.8) 

Thus in this system of coordinates the velocity is rotated with the magnetic field, its 
magnitude and angle to the normal remaining unchanged. 

The velocity v„ is also minus the velocity of propagation of the discontinuity relative to 
the fluid. This is equal to the phase velocity of the Alfven wave. The occurrence of this 
equality for all rotational discontinuities is to some extent accidental, but when the 
discontinuities of the various quantities are small the equality must hold. For such a 
discontinuity is a weak perturbation, in which the velocity v and the magnetic field H 
receive small increments perpendicular to the plane through H and the normal n. This 
perturbation is of the type whose phase velocity is . The physical velocity of propagation 
of the front of a small perturbation is the normal component of the group velocity, i.e. its 
component in the direction of the wave vector k. Since the relation between co and k is 
linear, we have k-dco/dk = co, and so this component is the same as the phase velocity 

CO/k = Uy, . 

Although tangential and rotational discontinuities form two different types, there are 
also discontinuities having the properties of both. These discontinuities are such that v and 
H are tangential in direction and continuous in magnitude. 

In ordinary fluid dynamics, tangential discontinuities are always unstable with respect to 
infinitesimal perturbations, and so are rapidly broadened into turbulent regions. A 
magnetic field, however, has a stabilizing effect on the motion of a conducting fluid, and in 
this case tangential discontinuities may be stable. This result is a natural consequence of the 
fact that a perturbation involving fluid displacements transverse to the field leads to a 
stretching of the lines of magnetic force “frozen” in it, and therefore to the appearance of 
forces which tend to restore the unperturbed flow. Let us ascertain the stability conditions 
for a tangential discontinuity in an incompressible fluid (S. I. Syrovatskil, 1953). We write 
v = vi _2 + v', P = P ,2 + F, H = H ,_2 + H', where Vi_2, f’,,2 and Hi_2 are the un¬ 
perturbed values constant on either side of the discontinuity; v', P' and H' are the small 
perturbations. Substituting in equations (66.7)-(66.9). we have for an ideal fluid 
div u' = 0, div v' = 0, 
du'/dt = (u’grad)v' — (vgrad)u'. 


(71.9) 

(71.10) 



§71 


Tangential and rotational discontinuities 


243 


— + (vgrad) v' = - - grad F - u xcurl u' 

=-grad (F + pu-u') + (u-grad)u'. 


(71.11) 


for brevity, we omit here and henceforward the suffixes 1, 2, and write u = \il^{Anp). 
faking the divergence of (71.11) and using (71.9), we find 


A(P'+pu-u') = 0. (71.12) 

Let the plane of the discontinuity be x = 0; the vectors v and u are parallel to it. In each of 
the half-spaces x > 0 and x < 0, we seek all quantities v', u', p in a form proportional to 
exp{i(l^ T ~ tot) -I- Kx}, where k is a two-dimensional vector in the yz-plane. From equation 
(71.12), we find that so that we must take k = k for x < 0 and k = - k for x > 0. 

Next, we eliminate v'^ from the x-components of (71.10) and (71.11), obtaining 


(71.13) 


The case where the expression in the braces is zero is of no interest, since to is then real, 
whereas instability can arise only from complex values of to. 

Let C (x, y, z) be the displacement, along the x-axis, of the discontinuity surface, as a result 
of the perturbation. On the displaced surface the conditions (71.1) and (71.2) must be 
satisfied: 

[P-l-F-l-p(u + u')"] S [F-|-pu-u'] = 0, 

“in + “'in = “'lx — (“l • grad)C = 0, 

“2n + “'in = “'lx - (“2 ■ grad)C = 0; 


^^wndition that there be no flux of the fluid through the discontinuity surface is 
^■liatically satisfied. Putting C = constant x e'dfr-wt) ^nd eliminating C, and Uj,, 
these three equations, we get an equation determining the possible values of to: 

(co-k-Vj)2 + (co-k-V2)^ = (k-Ui)^-|-(k-U2)^. 
quadratic equation has no complex root if 
^ 2(k-Ui)"-l-2(k-U2)"-(k-v)2>0 


(2“iiMik + 2u2iU2k —DjU^lkik^ > 0, 

~ ''2 ~ Vj is the discontinuity of the velocity. 

positive-definite if the trace and determinant of the rank-two 
>n the brackets are both positive. Hence we have the required stability conditions:t 

(HiXHj)^ 5:27tp{(HiXv)^ + {H2Xv)^}. j (71-14) 


^^'^£is“eXed by ‘^en p in ihese 
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In reality, however, the existence of a small but finite viscosity and electrical resistance in 
the fluid means that such tangential discontinuities cannot exist indefinitely, even if the 
conditions (71.14) are fulfilled. Although no turbulence occurs, the sharp discontinuity is 
replaced by a gradually widening transitional region, in which the velocity and the 
magnetic field change smoothly from one value to another. This is easily seen from 
the equations of motion (66.8) and (66.9) if the dissipative terms are retained. We take 
the x-axis in the direction of the normal to the discontinuity. Assuming all quantities to 
depend only on x (and possibly on the time), we can write the transverse components of 
these equations as 

dUJdt = (c■V47t(T)^^H,/^x^ ■) 
f’Vj/dt = vd^ yJSx^, J 

the fluid being supposed incompressible. If we assume steady flow, the left-hand sides of 
equations (71.15) are zero, and the only solution which remains finite as x-^ + oo is 
H, = constant, v, = constant, which contradicts the assumption that these quantities 
undergo a change at the discontinuity. Thus a tangential discontinuity cannot have a 
constant width such as is found for (e.g.) a weak shock wave. Equations (71.15) have the 
form of heat-conduction equations. As we know from the theory of thermal conduction, a 
discontinuity in a quantity satisfying such an equation is gradually smoothed out into a 
transitional region, whose width increases as the square root of the time. Since the 
coefficients in the two equations (71.15) are different, the widths and o t ^ 
transitional regions for the velocity and the magnetic field are also different: 

~ x/lvt), = \/ 

Rotational discontinuities in an incompressible fluid are stable with respect to 
infinitesimal perturbations, whatever the strength of the magnetic field (S. I. Syrovatskii, 
1953). Like tangential discontinuities, however, they cannot have constant widths, but are 
gradually smoothed out by the viscosity and electrical resistance of the fluid (see Problem). 


PROBLEM 


Find the manner of widening of a rotational discontinuity with time. 

Solution. Assuming all quantities to depend only on the coordinate x (and on the time), we find from Ae 
equations div v = 0 and div H = 0 that r. = constant and H, = constant. Let the coordinate system be^^h ‘to 
the values of v and H on each side of the discontinuity (outside the transitional layer) are related by (71.8). Then 
V, = u,, where u has the same meaning as in (71.9)-(71.11). For the transverse components u, and v, we have Iron, 
equations (66.8) and (66.9) 

fiu, ^ ^ 1 

dt fix 47t<T fix^ 1 


( 1 ) 


Since the difference v, - u, tends to zero for x -► + ' 
in the transitional layer in comparison with the sui 
term in \ — u,, obtaining 

c 


, because of the relations (71.8), this difference must be small 
V, + u,. Adding the equations (1), we can therefore neglecta 
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pom this we see that the width of the discontinuity varies in a manner given by 



v72. Shock waves 

Let us now consider the type of discontinuity in which 

jVO, [F]9^0. (72.1) 

Such discontinuities are called shock waves, as in ordinary fluid dynamics. They are 
^^erized by a discontinuity of density and by the fact that the gas moves through them 
and v „2 being non-zero). The normal component of the magnetic field may or may not 
be zero. 

On comparing equations (70.4) and (70.5) we see that, when H„ ^ 0, the vectors 
and hjHjj - h]H,j are parallel to the same vector v ,2 - v,j, and therefore to 
each other. Hence it follows that and H ,2 are collinear, i.e. the vectors Hj, H 2 and the 
normal to the surface are coplanar, unlike what happens (in general) in tangential and 
Alfven discontinuities, in which the H, n and H 2 n planes in general do not coincide. This 
result holds also when = 0; in this case, which we shall discuss later, it follows from 
(70.5) that FiM,! = k' 2 H, 2 . 

The velocity discontinuity - v ,2 lies in the same plane as Hj and H 2 . We can, without 
loss of generality, assume that the vectors Vj and V 2 themselves lie in this plane, so that the 
motion in the shock wave is two-dimensional. Furthermore, it is easy to see that, if ^ 0, 
a suitable transformation of the coordinates will always ensure that the vectors v and H are 
collinear on each side of the discontinuity. To achieve this, we use a coordinate system 
which moves with velocity v, - {v„/H„)n, = v, - {jV/HJU, (the value of this expression is 
the same on each side of the discontinuity, by (70.5)). In the following formulae, however, 
the choice of this particular coordinate system is not implied. 

I t us derive the relation for shock waves in magnetohydrodynamics which corresponds 
tt t e shock adiabatic (Hugoniot adiabatic) in ordinary fluid dynamics. Eliminating [v, ] 
'tom (70.4) and (70.5) we have 

= (72.2) 

replaced H, by since H,i and H ,2 are collinear.t In order to eliminate [v, ] 
equation (70.2), we rewrite that equation as 

[to ] -h ] -I- ^ V, - ^ H, ^ J -!_ [ 2 _ ^2 ]/327t2j2 = 0 . 

equation (70.4) and so v, does not appear. In the last term we 
otte j from (72.2) and in the second term from (70.3), i.e. 

f = {P2-P, + (H,2^-H„^)/S7:}/{V, - V,). (72.3) 
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A simple calculation then gives 

+Pi)(V2-Vi)+{y2-^i)(1^,2 - ti.i )Vl67t = 0. (72.4) 

This is the equation of the shock adiabatic in magnetohydrodynamics. It differs from the 
ordinary equation by the presence of the third term. 

We may also write out again equation (70.4), which gives the discontinuity of v, in terms 

of that of H(. c\ 

v ,2 - = H„ (H ,2 - H ,1 )/ 47 y. (72.5) 


Equations (122y{12.5) form a complete system of equations of shock waves. We shall 
conventionally assign the suffix 1 to the medium towards which the wave is propagated; 
thus the gas passes from side 1 in front of the shock wave to side 2 behind it. As already 
mentioned, the coordinates used are such that a particular surface element of the 
discontinuity is at rest and the gas passes through it. 

In ordinary fluid dynamics, Zemplen’s theorem (FM, §84) states that the pressure and 
density increase in the shock wave: 

F2>Fi- P2>Pi; 


the shock wave is therefore a compression wave. Here it is assumed that 

(d^VldP\>Q\ (72-7) 

although this is not a thermodynamic inequality, it is satisfied in almost all cases. Zemplen s 
theorem follows from the law of increase of entropy. 

It is easy to see that Zemplen’s theorem remains valid in magnetohydrodynamics for 
weak shock waves, provided that the condition (72.7) is satisfied. In a weak shock wave, the 
discontinuities of all quantities are small. Expanding equation (72.4) in powers of the 
pressure and entropy discontinuities, we have 

7-(s,-Si) = i(?^F/dP^)JP 2 -^'i)^-Y^(^)j^ 2 -^'i)(H, 2 -H,i)^ (72.8) 

the first term is that found in ordinary fluid dynamics (see PM. §83). Since - {d V/dP), > 0 
according to one of the thermodynamic inequalities, the condition Sj -a'i > 0 with (72.8) 
gives P 2 > Pi, and accordingly P 2 < ^i- 

If, in addition to the inequality (72.7), the thermal expansion coefficient is positive, 
{d v]dT)p > 0, Zemplen’s theorem in magnetohydrodynamics can be proved as in ordinary 
fluid dynamics, without assuming that the discontinuities of all quantities are small (R. V. 
Polovin and G.Ya. Lyubarskil, 1958; S. V. lordanskil, 1958). 

Let Pi, I'l be the specified initial state of the gas, and S 2 ‘°’ the entropy in the final state 
for a given value of in the absence of a magnetic field. The entropy m the final state 
of the gas in the presence of a magnetic field with the same values of Pi, Pi and P 2 
denoted by S 2 <^*. In ordinary fluid dynamics, P 2 > Pi implies that S 2 *°’ < Si, so that a 
rarefaction wave is impossible. We shall show that (under the conditions stated above) 
Sjtw) < 52 '°’, so that certainly s- 2 <"> < ^i; this proves that rarefaction waves are impossiblem 
magnetohydrodynamics also. 

Differentiating equation (72.4) with respect to P 2 for a constant P 2 , and using {dEjdP\ 
= T{dsldP)y, we find 

Pj (dS2<«VaP2)K, + 2(P2 - t^l ) + (56/^P2 )f, = 0, 


(72.9) 


0 

.here 
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Q = (^2- 

ggcause of the thermodynamic relations 



the first term in (72.9) has the same sign as (dV/dr)p, and is therefore, by hypothesis, 
positive. Hence, if 1^2 > then (SQ/dP2)v^ <0. The presence of a magnetic field 
increases Q (Q = 0 without a field, Q>0 with one), and so decreases P 2 for a given ( 2- 
Since, by hypothesis, (ds/dP), > 0, it follows in turn that 52 '^^' < 52*°*. This completes the 
proof 

Lastly, let us consider the case mentioned at the beginning of this section, when the 
magnetic field on each side of the discontinuity surface is in the tangential plane, = 0 
[s, perpendicular shock wave). From (72.5) we then have v ,2 = v„, i.e. the tangential velocity 
component is continuous. By a suitable choice of coordinates, therefore, we can always 
ensure that v, = 0 on either side of the discontinuity, i.e. the gas moves perpendicularly to 
the discontinuity, and we shall henceforth assume this. From equation (72.2) we have 
L 2 H 2 = Fif/j. This relation shows that equations (72.3) and (72.4) can be written 

j^ = (P2*-Pl*)/(Vi-V2), £2*-£l*+^{P2*+Pi*){V2-Vi) = 0, 

which differ from the ordinary equations for shock waves in the absence of a magnetic 
field only by a change in the equation of state: the true equation of state P = P(V,s) 
must be replaced by P* = P*( F, s), where P* = P + 0/8n V^, and h denotes the constant 
product H V. Accordingly e* must be defined so as to satisfy the thermodynamic relation 
(fie*/? F)j, = — whence £* = £ + V. 


173. Evolutionary shock waves 

an actual hydrodynamic discontinuity to exist, it must be stable with regard to 
IBigration into two or more others. This condition can also be formulated as stating 
mat anv :-c . . . - 


cha perturbation of the initial state must lead only to infinitesimal 

^ages in the discontinuity. A discontinuity that meets this requirement is said to be 
'"^ah emphasized that the property in question is not the opposite of 

^'adu'l'/^ '^ ordinary sense. The latter signifies that an initial small perturbation will 
y increase, which leads ultimately to the breakdown of the flow concerned; but the 
■Ration remains small over a sufficiently short time (t < l/y) even when there is an 


increase as e^' with y > 0. In a non-evolutionary discontinuity, the perturb- 
^ce immediately, although when t is small it occupies only a small region in 

situation is illustrated by Fig. 41 (p. 248), which shows the splitting of a 


of ‘fio density p(x) into two; the perturbation <5p is not small, although when f 
■'nail ’ discontinuities have not moved apart appreciably, it occupies only a 

" ’■ange ^x. 


t condition for an evolutionary discontinuity can be derived by counting the number 
cpendent parameters which define an arbitrary initial (t = 0) small perturbation of 
'^continuity, and the number of equations (linearized boundary conditions at the 
atinuity) which these parameters must satisfy. The discontinuity is evolutionary if the 
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Fig. 41 


two numbers are the same. The boundary conditions then uniquely determine the 
subsequent development of the perturbation, which remains small for small t > at 
If the number of equations is greater or less than the number of unknown parameters, the 
problem of a small perturbation of the discontinuity has either no solution or an infinity ol 
solutions. Either case may occur, and such a situation indicates that the discontinuity is not 
evolutionary, the initial assumption that the perturbation is small for small t being 

incorrect. . 

In ordinary fluid dynamics, the requirement that a shock wave be evolutionary does not 
impose any further restriction in comparison with the condition of increasing 
shock waves allowed by Zemplen’s theorem are necessarily evolutionary (see EM, §84). In 
magnetohydrodynamics, this is not the case, and the requirement that a shock wave be 
evolutionary imposes significant new limitations on the way in which quantities vary in the 
shock wave (A. I. Akhiezer, G. Ya. Lyubarskii and R. V. Polovin, 1958). 

In order to ascertain the actual condition for shock waves to be evolutionary in 
magnetohydrodynamics, let us first count the number of equations to be satisfied by any 
small perturbation at the discontinuity surface. ■ ■ 

The shock wave will be regarded as lying in the yz-plane. The positive x-direction is 
taken to be that of the flow across the discontinuity surface. The unperturbed fields 
and velocities n,, on either side of the discontinuity are assumed to be m the xy-plane. 

On each side of the discontinuity surface, seven quantities undergo perturbation; three 
fluid velocity components v^, v^, v/, two magnetic field components H,', the density 
p= l/V and the entropy s. The perturbations of the other thermodynamic quantities (P 
and w) are determined by those of p and s. The equation div H = dHj0x = 0 shows that 
the longitudinal field component H, is constant along the x-axis and does not undergo any 
perturbation. In addition, the speed of propagation of the shock wave itself is perturbed, re. 
it acquires a small velocity 6U relative to the chosen system of coordinates (m which the 
unperturbed discontinuity is at rest). This velocity, however, can be expressed immediately 
in terms of the perturbations of p and v,, by means of the condition for the mass flux 
density j to be continuous across the discontinuity; the gas velocity relative to the 
discontinuity is v^^ + 6v^-8U, where is the unperturbed velocity and its 
perturbation; writing also p = Po + linearizing the boundary condition [7 1 , an 


t Here the wave may be either unstable (if the eigenfrequencies of the equations include some which are 
complex with a positive imaginary part) or stable (if not). 
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jt,en omitting the suffix zero to the unperturbed quantities, we find 
[5j ] = [p5v^ ] + [v^5p ] - [p ] = 0, 

[jich determines SU. 

Linearization of the boundary conditions of continuity of the momentum flux 
.opponent fl^^ and the electric field component (i.e. the z-components of equations 
i 70.4) and (70.5)) gives the two equations 

[pvjv^ - HJHJAn ] = 0, [HJv, - vJH^ ] = 0 

lthe unperturbed values are v^ = 0,H^ = 0). These equations involve the perturbations of 
only two quantities, namely 


The boundary conditions of continuity of the energy flux q^, the momentum flux 
components 11^^ and 11^,^, and the electric field component E^ (i.e. equations (70.2) and 
(70.3) and the y-components of (70.4) and (70.5)) give four linear equations involving the 


5v^, Sv^, 5H^, 5p, 3s, 


(73.2) 


which will not be written out here. 

Let us now count the number of parameters which define the perturbation of the shock 
wave. Perturbations varying with time as e“‘“' are propagated in both directions from the 
discontinuity as hydromagnetic waves of three kinds (Alfven, and fast and slow magnetosonic) 
and as an entropy wave; the latter is a small perturbation of the entropy transported by the 
adiabatic gas flow and moving with the gas. All these must of course be outgoing waves 
propagated to the left or right of the discontinuity. In each wave, the changes in all 
quantities are related in a specific way, as shown in §69, and therefore each wave is 
determined by just one parameter, the amplitude of some single quantity. 

Magnetosonic and entropy waves transport the perturbations (73.2); Alfven waves, the 
I^rturbations (73.1). Since the equations separate for these two groups of perturbations, 
co ndition for an evolutionary discontinuity has to be satisfied for each separately (S. I. 
■l^atskiT, 1958), and this makes the restrictions even stronger. 

^ ut us first consider the conditions as regards Alfven perturbations. This requires that the 
Pl^ outgoing waves be equal to two, the number of equations. The Alfven wave 
^ u velocities relative to the discontinuity surface can have the values ± 

*^.42, where Uy, is the phase velocity (69.6) of the wave relative to the gas. Because of the 
Wav for the x-axis, , v^2 > 0- In region 1, in front of the discontinuity, the 

nf H 'f phase velocity (relative to the discontinuity) is negative; in region 2, 

^ ^ the discontinuity, a positive phase velocity is needed. The wave with velocity 
never satisfies this condition, and is always ingoing; that with velocity 
is outgoing if v,^ < um. Similarly, the wave with velocity i >,2 + u ^2 is always 
Which *""2 - ua 2 is outgoing if > ua 2. There are therefore two ranges in 

n the condition for an evolutionary discontinuity is satisfied for Alfven waves- 


^ese 


( 1 ) V^2>U^2, 

( 2 ) 

are shown by the vertical lines in Fig. 42 (p. 250), which is drawn for the case in which 

u,<Uy^<Uj.. ( 733 ) 
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The condition as regards magnetosonic and entropy perturbations requires that there be 
four outgoing waves. There is always an outgoing entropy wave moving with the gas but 
only from side 2. The number of outgoing magnetosonic waves must therefore he three. 
Arguments similar to those given above for Alfven waves indicate two ranges in which *e 
condition for an evolutionary discontinuity is satisfied for this group of perturbations, they 
are shown by the horizontal lines in Fig. 42.t 

The intersection of the vertically and horizontally lined areas defines two regions w 
the discontinuity is evolutionary with regard to all perturbations: those o ( )jast s oc 


and (2) slow shock waves, with 

(73.5) 

here the normal component of the gas velocity is again denoted by v 
limit of a low-intensity wave (discontinuities of all quantities weak), the fa^t an 
waves are propagated with the respective velocities M /2 = «/i ^nd u .2 wave.^'^ 

Let us apply these conditions to find how the magnetic field varies in a sFock w 
start from equation (72.2): 


[H,]/4ny" = [H,/p] 


Since this can also be written as 
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jrfO 0 i the inequalities (73.4) and (73.5), with (73.7), we see that the tangential fields on the 
sides of the shock wave are not only collinear but parallel. 

In slow shock waves we have, on both sides of the discontinuity. 

< H„^/4nj = v^^/v„. 

Noting aJso that by the continuity of the mass flux = p 2 we deduce from p, < P 2 


and hence, from (73.6), H,2 < : in a slow shock wave, the tangential magnetic field 

becomes weaker. In a fast wave, however, v„ > H„^l4nj, and from (73.6) it follows that 
> atu the tangential magnetic field thus becoming stronger. 

A particular case of shock waves is that where the magnetic field is parallel to the normal 
on both sides of the discontinuity surface. As mentioned at the beginning of §72, we can 
always choose the coordinates so as to make v and H parallel on both sides. In the case 
considered, we then have H,, = H,2 = 0, v,i = v,2 = 0 (a parallel shock wave). For such a 
shock wave, the boundary conditions do not involve the magnetic field, i.e. they are the 
same as in ordinary fluid dynamics. The presence of the magnetic field has the result, 
however, that over a certain range of shock parameter values the conditions for an 
evolutionary discontinuity are no longer satisfied, and such shock waves become 
impossible (see Problem 1). 

All the perpendicular shock waves described at the end of §72 are evolutionary fast 
compression shock waves. They are seen to be fast from the fact that when = 0 we have 
= u, = 0. 

Having discussed the various types of discontinuity in magnetohydrodynamics, let us 
now consider the possible existence of forms transitional between these types, i.e. 
discontinuities having the properties of two types at the same time. Such possibilities are 
severely limited by conditions which result from the requirement of the evolutionary 
property. 

An Alfven discontinuity, first of all, cannot pass continuously into a shock wave, since 
Or the latter the magnetic fields on either side are coplanar with the normal to the 
■scontinuity surface. Such a shock wave can coincide with an Alfven discontinuity only if 
H turns through 180° in it. But the tangential field component would then 
^rrge sign, whereas in an evolutionary shock wave it does not change sign. 

^ continuous transition between fast and slow shock waves would be possible only with 
~ 0, since in a fast shock the field H, (if not zero) is strengthened, but in a slow 
fast' Jri other words, a continuous transition would be possible only between 

•'a ^fow parallel shocks. But the ranges where these are evolutionary meet only at 
^onti slow shock disappears (see Problem 1 ). Thus there cannot be a 

^ nuous transition between fast and slow shocks. 

continuously into a tangential discontinuity, because of the 

'(“aiities (73.4). 

therefore be continuous transitions only between a tangential discontinuity 
“her a contact discontinuity, an Alfven discontinuity, or a slow shock wave. 
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Solution. For such a gas, the heat function »v = 5P/2p, and the boundary conditions (70.1)-(70.3) beconj 
PlVi = P2V2, Pi+p,vd = P2 + C,^ + 5Pi/p, = C2^ + 5P2/P2- 
Hence V 2 = + 3uoi^) 4ci. where Uo = \/*5P/3p) is the ordinary velocity of sound, and 

"a 2 = JiPilPi^Ai = + 

«02 = Vi + 3uoi")}/4n„ 

Ufi =max(Uo,,u,4i). “si = niin(uoi,«/ii), 
u^2 = niax(uo2.‘'/l2). “*2 = niin(Uo2.U2i2)- 

When < “00 the shock wave is fast and always evolutionary. Figure 43 shows the approximate form of the 
functions C2(ci) (thick curve) and U/12 (li), ttj2(ui ) = 002(^1) in this case; the slanting dot-and-dash line bisects the 
right angle. 

Figure 44 shows a corresponding diagram for the case u^, > Ug,. The thin curves are again the functions 
“a 2 ("i) and Ug 2 (U]); the various parts of these curves are marked to show whether u ^2 of “,2 coincides with them 
The thick continuous curve is the function 1)2 (c,) for the evolutionary ranges; the left-hand and right-hand parts 
correspond to slow and fast shock waves respectively. The thick dashed curve is the non-evolutionary range u,,, 
< I), < ending on the right at the point V 2 = u^ 2 -f 

Problem 2. In front of a shock wave the tangential magnetic field H,i = 0, and behind it H,2 0; this is 

called a switch-on shock. Find the range of values of c„i which such a shock wave can have, in a gas with the same 
thermodynamic properties as in Problem 1. 

Solution. It follows from (72.2) that when H„ = 0 the speed of the shock wave relative to the gas behind ii is 
c„2 = H„iJ(Anp2) = U212. relative to the gas in front c„i = P2 v^JPi = “Ai \/{P2lPi ) > “/ll • Th's is a fast 
shock; c„, and c„2 are related by i;„] c„2 = Uxi^ t 

For a switch-on shock in a gas with the specified thermodynamic properties, the boundary conditions give 

H,2^/Sn = {pd3uyid)(“„d - “Ai^)(^“Ad 

Since the right-hand side must be positive and c„i > we see that the possible values of u„i in a switch-on shock 

lie in the range 

“Ai < ‘’.1 < 




t See Problem 2 as regards the segment BB. „ , u c 

t A switch-off shock is one in which H„ ^ 0 and H,2 = 0. This is a slow shock, with 

^Themmarii'd^ouw’aslo whether the switch-on and switch-off shocks are evolutionary; for ext 
switch-off shock c„, > u^i, and it might seem that only one Alfven perturbation wave ran leav 
backwards with i;„” -1- u,,2 = 2n,42- H must not be forgotten, however, that when a perturbation is impo 
tangential magnetic field becomes non-zero. 
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^jse inequalities show that switch-on shocks can occur only if > Uoi- In Fig. 44 they correspond to the 
^ent BB of the thin continuous curve. 

s74. The turbulent dynamo 

K ui bulent motion of a conducting fluid has the remarkable property that it may lead to 
p&neous magnetic fields which are quite strong. This is called the turbulent dynamo 
effect. There are always small perturbations in a conducting fluid, resulting from causes 
extraneous to the fluid motion itself (for example, the magnetomechanical effect in 
rotating parts of a fluid, or even thermal fluctuations), and accompanied by very weak 
electric and magnetic fields. The question is whether these perturbations are, on the 
average, amplified or damped by the turbulent motion in the course of time. 

The manner of variation with time of magnetic field perturbations, once they have 
arisen, is determined by various physical agencies. The magnetic field tends to increase, by 
the purely hydromagnetic effect of the stretching of the lines of force. We have shown in 
§65 that, when a fluid with sufficiently high conductivity is in motion, the lines of magnetic 
force move as fluid lines “frozen” in, and the magnetic field varies proportionally to the 
stretching at each point on each line of force. In turbulent motion any two neighbouring 
particles move apart, on the average, in the course of time. As a result, the lines of force are 
stretched and the magnetic field is strengthened. 

The dissipation of magnetic energy, which is converted into the Joule heat of the induced 
currents, tends to diminish the field. Since the energy dissipation is proportional to 
(curl H)^, i.e. is quadratic in the spatial derivatives of the field, it is clear that the dissipation 
will be small in flow where the spatial scale of variation of the field is sufficiently great. This 
does not mean that the field will be strengthened over such distances. The reason is that the 
lines of force are not only stretched but also “tangled”, which reduces the spatial scale. A 
situation may therefore occur where the strengthening of a field with a given scale is 
replaced merely by an energy flux from large-scale turbulent eddies to smaller-scale ones; 
"hen the scale becomes sufficiently small, the energy is dissipated. 

Such a situation would occur in “two-dimensional” turbulence, where the fluid velocity v 
^®ays parallel to a fixed xy-plane (Ya. B. Zel’dovich, 1956). It must be stressed that the 
^Bting field H is not assumed to be two-dimensional. The proof is as follows. 

et us first consider the time variation of the field component perpendicular to the 
jw. With the equations div H = 0 and div v = 0 (the fluid being assumed incompressible 
' e present section), the z-component of equation (66.1) is 

dHJdt= -{ygTaA)H, + v^AH„ (74.1) 

term describes just the transport of a given value of 
e fluid particle to which it belongs. The second term describes the “diffusional” 
^^tion of H,_ values at various points in the fluid. It is evident that neither can cause 
perturbation //, occupies a finite region of space, it will 
.^Ppear in the course of time because of the “difTusion”. 

W components and decay, we can put H, = 0, since what we 

^ to exclude is the possibility that these components remain when has decayed.t We 

do this under the further restriction that all quantities (v and H) are independent of 


254 Magnetohydrodynamics 

the coordinate z.t Then the vector curl H is in the z-direction, and the same is true of v x jj 
so that the electric field E is likewise in the z-direction; see (66.6). In such a case thj 
electromagnetic field can be described by means of a vector potential A which is in the 
z-direction and independent of the coordinate z: H^ = dA^/dy, H^= 

E^= — (\lc)dA^lct, div A = dA^/dz = 0. Substituting these expressions in (66.6), we find 
after a simple rearrangement the equation for A^: 

dA^/dt = - (v • grad) A^ + v^AA^, (74.2) 

which agrees exactly in form with (74.1). Hence it again follows that in the course of time 
the perturbations A^ decay, and therefore so do and 

The turbulent dynamo is thus an essentially three-dimensional effect. As an illustration, 
the following is an example of a flow which amplifies the field without altering its spatial 
scale (S. I. Vainshtein and Ya. B. Zel’dovich, 1972). Let us consider a set of closed lines of 
magnetic force frozen inside a torus of fluid (Fig. 45a). Let the flow stretch the torus by a 
lengthwise factor of two. say (Fig. 45b); the cross-sectional area decreases by the same 
factor, and the magnetic field increases. Next, suppose that the torus is twisted by the flow 
(Fig. 45c), and that the loops are then superimposed (Fig. 45d). This gives a configuration 
of about the same size as the original one, but with twice the field in the torus. Repetitions 
of this cycle give an unlimited exponential increase of the field. Such a flow is evidently 
fundamentally three-dimensional. 

This illustration is, of course, not a proof that the turbulent dynamo actually exists; there 
are also flows which reduce the scale of the field. To settle this point, we need to make a 
direct investigation of the stability of turbulent motion of a conducting fluid with respect 
to small initial perturbations of the magnetic field. This approach has made it seem very 
likely that, when the magnetic Reynolds number is sufficiently large, a magnetic field is in 
fact generated.^ These fairly complicated analyses will not be given here; we shall discuss 
only the general form of the turbulence established by the process, on the assumption that 
the turbulent dynamo exists. 

Turbulent flow may be regarded as an assembly of turbulent eddies of various sizes, from 
the basic “external” scale / to the smallest “internal” scale . The former is the same as t e 
characteristic dimensions of the region in which the turbulent flow takes place; the latter 
gives the order of magnitude of the distances over which energy dissipation 
important (see FM, §§31, 32). By “steady turbulence” we mean bhat the 
characteristics (averaged over times that are of the order of the periods o 
corresponding fluctuations but of course are short compared with the total observa i 



(b) 

Fig. 45 



t This assumption is made here only for purposes of simplification and is not fundamental The same r 

; reached without this restriction by somewhat more complex arguments ,982 

5e S. I. Vainshtein, Zhurnal eksperimental’noi i teoreticheskoi fiziki 79, 2175, 1980; 83, 161, 19»z 
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time) are constant. The suffix a will denote mean values for eddies with scale /.; for example, 
i); and are the mean variations of the velocity and the field over distances ~ a. 

The statement that a turbulent dynamo exists means that on the basic scale I there are 
magnetic fields whose energy density H,^ISn is comparable with the kinetic energy density 
of the fluid. That is, the Alfve'n velocity in the field H,, 

u^~H,/^(4npl m3) 

is comparable with the basic scale of the fluid velocity Vi = u (the change in the mean 
velocity over distances ~ /). For scales the fields H,.t 

We must immediately emphasize the chief difference between magnetic and ordinary 
turbulence. In the latter, motion on the basic scale does not significantly affect the 
properties of the small-scale eddies, but simply transports them convectively. In the 
magnetic case, however, the basic-scale field H, affects the motion on all smaller scales. 

Since for scales 2 <^ / the field H, may be regarded as locally uniform, and H,, the 
small-scale flow in this case is just an assembly of small-amplitude hydromagnetic waves 
with wave numbers k ~ 1/2 and velocities ~ u^. According to (69.11), in these waves the 
kinetic energy of the fluid is the same as the magnetic energy. That is, in the small-scale 
eddies there is almost exact equipartition between the magnetic and kinetic energies: 

~ (74.4) 

This relation, as an order-of-magnitude one, can be extended to the basic scale, for which it 
gives u ~ Uji, in agreement with the assumption made. 

Let us consider the range of scales 2 

l>X>Ao. (74.5) 

It must be remembered that the viscous and Joule dissipations will in general become 
important for different values of 2, and in this sense magnetic turbulence may have two 
internal scales, Xq in (74.5) being taken as the larger of the two, so that in the inertial range 
(74.5) there is no dissipation. 

As in ordinary turbulence theory, we introduce the mean amount of energy e dissipated per 
unit time and unit mass of fluid. This energy is derived from the large-scale flow, whence 
it is gradually transferred to smaller and smaller scales until it is dissipated in eddies of size 
2 < 2o. It is evident that in the inertial range, where there is no dissipation, s is also the 
constant (independent of 2) flux of energy towards smaller scales. In ordinary non¬ 
magnetic turbulence it was possible to assert that the local properties (i.e. over distances 
2 <^ /) of the turbulence must be determined only by p, e and of course the distances 2 
themselves, but not by the scales I and u of the sizes and velocities as a whole; this was 
sufficient to give the dependence of on 2 from dimensional arguments. In magnetic 
turbulence, the local properties may depend also on the field H, (or, equivalently, on the 
velocity u ^). Dimensional arguments then do not suffice to determine , and it is necessary 
to bring in the actual mechanism by which the energy flux is established. 


t Here we are considering the generation of a turbulent magnetic field whose spatial variation has 
characteristic dimensions < 1. The generation of a “large-scale” field with dimensions > I is found to be possible 
only if the averaged properties of the turbulent flow are not invariant under spatial inversion, a case not discussed 
here. The theory is given in the books by S. I. Vamshtein, Ya. B. Zel’dovich and A. A. Ruzmaikin, Turbulentnoe 
dinamo v astrofizike (The Turbulent Dynamo in Astrophysics), Nauka, Moscow, 1980; H. K. Molfatt, Magnetic 
Field Generation in Electrically Conducting Fluids, Cambridge University Press, Cambridge, 1978. 
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This mechanism is the mutual interaction of small-amplitude hydromagnetic waves 
which is described by non-linear terms in the equations of motion. The energy flux e niusi 
therefore be expanded in powers of the small amplitudes , and this expansion must begjj 
with terms above the second order (the quadratic terms would correspond to ordinary 
dissipation, which is absent here). The third-order terms would depend on the random 
phases of the interacting waves, and would vanish on averaging over these phases. Hence £ 
oc V*. The proportionality factor can be found from dimensional arguments (th^ 
dimensions of e are erg/g-s = cm^/s^): 

£ ~ (74.6) 

Vi, ~ (74.7) 

(R. H. Kraichnan. 1965). This replaces Kolmogorov and Obukhov’s law ~ 
ordinary turbulence. Extrapolation of (74.6) to the basic scale gives for e the estimate 
£ ~ u^/u^l ~ u^/l, as in non-magnetic fluid dynamics. 

The internal scale of turbulence Aq can be estimated by regarding the small eddies as 
hydromagnetic waves against the background of the large-scale field H,. The viscosity and 
conductivity of the medium cause absorption of these waves; the corresponding 
absorption coefficient y has been found in §69, Problem. The dissipation becomes important 
when the absorption length l/y is comparable with the wavelength, i.e. with the scale 7.. 
Since the hydromagnetic waves are propagated with velocity (independent of their 
wavelength), the frequency w ~ uJX. The absorption coefficient can be estimated as 
y ~ (v -t- v„,)/2^u^. From the condition that y ~ 1/2 when A ~ Ao, we find the internal scale 

^0 ~ (v + v„. )/ux ~ (v + v„. )/u. (74.8) 

If V|> v„,, then A,, ~ v/u ~ l/R, where R~ ul/v is the Reynolds number for the basic 
motion. Similarly, if > v. then Aq ~ l/R^- 

There is one further interesting property of the turbulent flow of a highly conducting 
medium: the magnetic field is displaced from the turbulent region. Let us consider a finite 
region occupied by the turbulence, with a magnetic field outside it. The lines of force of this 
field, on entering the turbulent region, become tangled, since they are “frozen’; t e 
magnetic field becomes random in direction. This means that the time-average field 
becomes almost zero, and the higher the conductivity, the more neariy exact is t^^ 
vanishing of the field; finite conductivity gives rise to a “slippage” of the lines 
that the field does not become completely random. In other words, when a fairly wea^ 
magnetic field is applied to a fluid in turbulent motion (within a limited region), the u 
behaves diamagnetically with a small permeability p <€ 1 which decreases with increas ^ 
magnetic Reynolds number R„,. ■ v, fluid- 

If the magnetic field is sufficiently strong, however, it is unable to penetrate into t ® 

This does not mean that a strong field will suppress turbulence completely. 
dimensional turbulence, with the fluid velocity everywhere parallel to the the 

independent of the coordinate z, can occur in any uniform external magnetic held 
z-direction), no matter how strong. For, in this case, curl (v x H) = (H ■ grad)v = T ^ 
follows from (65.2) that the flow does not perturb the external field, which re 
uniform. No currents arise, therefore, and the Lorentz force is zero. We may say tha 
two-dimensional flow “does not see” a uniform field. In a strong external field- 
turbulence degenerates into just this two-dimensional form. 


CHAPTER IX 


THE ELECTROMAGNETIC WAVE EQUATIONS 


^75. The field equations in a dielectric in the absence of dispersion 

§58 we gave the equations for a variable electromagnetic field in a metal: 

curl H = 47ro-E/c, curl E = - (1/c) dBjdt, (75.1) 

which hold when the field changes sufficiently slowly: the frequencies of the field must be 
such that the dependence of j on E (and of B on H, if needed) is that corresponding to the 
static case.f 

We shall now examine the corresponding problem for a variable electromagnetic field in 
I dielectric, and shall formulate equations valid for frequencies such that the relations 
between D and E, and B and H, are the same as when the fields are constant. If, as usually 
happens, these relations are simple proportionalities, this means that we can put 

D = eE, B = TiH. (75.2) 

with the static values of e and n. 

These relations are not valid (or, as we say, e and /i exhibit dispersion) at frequencies 
comparable with the eigenfrequencies of the molecular or electronic vibrations which lead 
to the electric or magnetic polarization of the matter. The order of magnitude of such 
^^encies depends on the substance concerned, and varies widely. It may also be entirely 
ilifferent for electric and for magnetic phenomena.f 
The equations 

divB = 0, (75.3) 

curl E = - (1/c) dB/dt (75.4) 

obtained immediately by replacing e and h in the exact microscopic Maxwell’s 
^Quations by their averaged values E and B, and are therefore always valid. The equation 

div D = 0 (75.5) 

j^^btained (§6) by averaging the exact microscopic equation dive = 47rp, using only the 
^bat the total charge on the body is zero. This result is evidently independent of the 
^Wmption made in §6 that the field is static, and equation (75.5) is therefore valid in 
'enable fields also. 


'vn as they stand. In the problems 

J Side mrconductor condition was necessao'm order to justify the negleet of retardation effects in the 

ihe". example the electric polarization is due to the electrons, and the dispersion of r begins only 

Ultra-violet. In a polar hqujd such as water, the polarization is due to the orientation of rnolecmes with 
•anent dipole moments, and the dispersion of e appears at frequencies to ~ 10" sec” ‘ i e in the cent' 
-length range. The dispersion of fi in ferromagnets may begin at even lower frequencies. ™ 



When the macroscopic field depends on time, the establishment of the relation between the 
mean value pv and the other quantities is fairly difficult. It is simpler to effect the averaging 
in the following more formal way. 

Let us assume for the moment that extraneous charges of volume density are placed 
in the dielectric. The motion of these charges causes an “extraneous current” and the 
conservation of charge is expressed by an equation of continuity; 

^Pex/^t + divjex = 0. 

Instead of equation (75.5) we have div D = 47rpex; see (6.8). Differentiating this equation 
with respect to time and using the equation of continuity, we obtain 5(divD)/5t 
= Andpjdt = - 471 div jex, or 

div(^+47rjex) =0. 


Hence it follows that the vector in parentheses can be written as the curl of another vector, 
which we denote by cH. Thus 


Outside the body this must be the same as the exact Maxwell’s equation for the field in a 
vacuum, and therefore H is the magnetic field. Inside the body, in the static case, the current 
jex is related to the magnetic field by the equation curl H = 47rjex/c, where H is the Q^"***^ 
introduced in §29 and related in a definite manner to the mean field B. Hence it fo ow 
that, in the limit of zero frequency, the vector H in equation (75.8) is the static 
H(B), and our present assumption that the field varies “slowly” means*that the ^ 
relation H(B) holds between these variable fields. Thus H is a definite quantity, so t a 
can drop the auxiliary quantity jex and obtain the final equation 
curlH = (l/c)5D/5t. 


The quantity D/47r is called the displacement current. jt 

This equation replaces in dielectrics the first equation (75.1) for the field in m 
might be supposed that the term in 5E/5t ought to be included when this equation 
for variable fields in metals also, giving 


with E constant. In good conductors such as the true metals however, the introductioj^^^ 
this term is pointless. The two terms on the right-hand side of (75.10) are essentially th . 
two terms in an expansion in powers of the field frequency. Since this frequency is assu i 
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small, the second term must represent at most a small correction. In actual fact, in metals 
the corrections for the effect of the spatial non-uniformity of the field become important 
sooner than the frequency correction (see the first footnote to §59). 

There are, however, substances, namely poor conductors, for which equation (75.10) 
may be meaningful. For such reasons as the small number of conduction electrons in 
semiconductors, or the small mobility of the ions in electrolyte solutions, these substances 
"exhibit anomalously low conductivity, and hence the second term on the right of equation 
(75.10) may be comparable with, or even exceed, the first term at frequencies for which a 
and e may still be regarded as constants. In a field of a single frequency to, the ratio of the 
second term to the first is ewIAna. If this ratio is small, the body behaves as an ordinary 
conductor with conductivity a. At frequencies to > Anafe, it behaves as a dielectric with 
permittivity e. 

In a homogeneous medium with constant e and p, equations (75.3)-(75.5) and (75.9) 
become 

divE = 0, divH=0, (75.11) 


curl E = 


pdU 

c dt ’ 


(75.12) 


Eliminating E in the usual manner, we obtain 

£ d 

curl curl H = — curl E = 

cdt 


c^ dfi 


and, since curl curl H = grad div H—AH = —AH, we reach the wave equation 


epB^H 
^ dt^ 


A similar equation for E can be obtained by eliminating H. We see that the velocity of 
.propagation of electromagnetic waves in a homogeneous dielectric is 

c/y {enl (75.13) 

The energy flux density is made up of the electromagnetic field energy flux and the flux of 
energy carried directly by the movement of matter. In a medium at rest (as is considered 
here) the latter is zero, and the energy flux density in a dielectric is given by the same 
formula (30.20) as in a metal; 

S = cExH/47r. (75.14) 


This is easily seen by calculating divS. Using equations (75.4) and (75.9), we obtain 


471 \ dt dt ) dt’ 


JO accordance with the expression dU = (E-dD-i-H-dB)/47r for the differential of the 
'Oternal energy of a dielectric at given density and entropy. 

The requirement of symmetry of the four-dimensional energy-momentum tensor for 
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anv closed system (in this case, a dielectric in an electromagnetic field) implies that the 
energy flux density must be the same, apart from a factor c^, as the spatial density of 


momentum (see Fields, §§32, 94), which is therefore 
xH/47rc. 


(75.16) 


(75.17) 


This expression must, in particular, be used in determining the forces on a dielectric m a 
variable electromagnetic field. The force f per unit volume may be calculated from the 
stress tensor f, = dajdx^. Here, however, it must be remembered that <7;^ is the 
momentum flux density, which includes the momentum of both the matter and 
the electromagnetic field. If f is taken as the force on the medium, the rate of change of the 
field momentum per unit volume must be subtracted; 

d (ExH)i 
dXi dt 4710 

In a constant field the last term is zero, and so this question did not arise previously. 

Since the field varies slowly, the stress tensor may be taken to have the same value as in a 
static field. For instance, in a fluid dielectric, Uj^is given by the sum of the electric part (15.9) 
and the magnetic part (35.2). In differentiating these expressions 
coordinates we must use the fact that the equations curl E 0, curl H - 0 for a static 
field (in the absence of currents) are replaced by equations (75.12). This introduces new 
terms -eEx curl E/ 47 r-^Hx curl H/47r, which are now not zero but equal to 




^Exn-^.Hx£=^i(ExH). 


47rc 


The required force is therefore 
f = -gradPo(p,F)- 


' Stt 


grad e 


Stt 


grad^i 


+ grad 


Kiu-eia 


e/i-l d 


4tic dt 


(ExH). 


(75.18) 


The last term in this expression is called the Abraham force (M. Abraham, 1909). 


§76. The electrodynamics of moving dielectrics 

The motion of a medium results in an interaction between 
fields. Such phenomena for conductors have been discussed in §63, we in 

them for didectrics. Here we are in practice concerned with the Phenomena occurring^ 
moving media when external electric or magnetic fields ^ result of the 

emphasized that they are in no way related to the appearance of fields 

the trame of reference is changed. There ^ ,l,e averaging of «hie» 

the transformation of electnc and ~^e"c of E ™d B. In Llectries d* 

greater number of quantities. 
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In the motion of macroscopic bodies, the velocities involved are usually small compared 
^^,ith the velocity of light. To obtain the necessary approximate transformation formulae, 
jjowever, it is simplest to use the exact relativistic formulae which hold for all velocities. 

In the electrodynamics of the field in a vacuum, the components of the electric and 
fcgnetic field vectors e and h are actually components of an antisymmetrical four- 
^fcensional tensor (or four-tensor) of rank two (see Fields, §23). The same is true of E and 
B which are the mean values of e and h. Thus there is a four-tensor whose components 
are given byt 


-E^ 0 -JB, 

-Ey B, 0 

i -B, B^ 


-B„ 



Using this tensor, the first two Maxwell’s equations, 

div B = 0, curl E = — {llc)dBldt, 
can be written in the four-dimensional form 


dx'' 


' dx^ dx>‘ 


= 0 . 


(76.2) 


(76.3) 


This shows the relativistic invariance of the equations. The applicability of equations 
(76.2) to moving bodies is evident, since they are obtained directly from the exact 
microscopic Maxwell’s equations by replacing e and h by their averaged values E and B. 
The second pair of Maxwell’s equations 

div D = 0, curl H = (1/c) dT)/dt (76.4) 


also retain their form in moving media. This is seen from the arguments given in §75, in 
which we used only general properties of bodies (e.g. that the total charge is zero), equally 
valid for moving bodies and bodies at rest. However, the relations between D and E, and B 
and H, need not be the same as in bodies at rest. 

Since they are valid for bodies both at rest and in motion, equations (76.4) must be 
unaltered by the Lorentz transformation. For a field in a vacuum, the vectors D and H are 
the same as E and B, and the relativistic invariance of the second pair of Maxwell’s 
equations appears in the fact that they also can be written in four-dimensional form, using 
the same tensor F^^: dF^^jdx^ = 0 (see Fields, §30). Hence it is clear that, to ensure the 
relativistic invariance of equations (76.4), it is necessary that the components of the vectors 
tJ and H should be transformed as the components of a four-tensor exactly similar to 
which we denote by 



(76.5) 


t (n the present section the four-dimensional tensor suffixes, which take the values 0, 1, 2, 3, are denoted by 
'^fvek letters A, fi, v. 
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Using this tensor, we can write equations (76.4) in the form 

= 0. (76.6, 

Having elucidated that the quantities E, D, H, B form four-dimensional tensors, we 
have also ascertained the law of their transformation from one frame of reference to 
another. However, we are interested rather in the relations between the quantities in ^ 
moving medium, which generalize the relations D = eE and B = valid in a medium at 
rest. 

We denote by u" the velocity four-vector of the medium; its components are related to 
the three-dimensional velocity v by 


_Y 

cVd-i^/cO/ 

From this four-vector and the four-tensors f and we form combinations which 
become E and D in a medium at rest. These combinations are the four-vectors and 
for V = 0 their time components are zero and their space components are E and D 
respectively. The four-dimensional generalization of the equation D = eE is therefore 
evidentlyf 

= eF^^‘u^. . (76.7) 

Similarly, we see that the generalization of B = /rH is the four-dimensional equation 

(76.8) 

Returning from the four-dimensional to the three-dimensional notation, we derive from 
these two equations the vector relationsf 


D + V X H/c = e(E + V X B/c), j 
B-l-Exv/c = 7 i(H-|-Dxv/c). J 

These formulae, first derived by H. Minkowski (1908), are exact in the sense that no 
assumption has yet been made concerning the magnitude of the velocity. If the ratio v/c is 
assumed small the equations can be solved for D and B as far as terms of the first order to 
give 

D = eE -I- (e^r - l)v xH/c, * (76-10) 

B = 7 iH + (e 7 i-l)Exv/c. (76.il) 

These formulae, together with Maxwell’s equations (76.2) and (76.4), form the basis for tho 
electrodynamics of dielectrics in motion. 

The boundary conditions on Maxwell’s equations are also somewhat modified. Froff 
the equations div D = 0, div B = 0 the continuity of the normal components of t ^ 
inductions follows as before: 

F>„i=D„2, 

The conditions on the tangential components of the fields are most simply obtained b) 


t It should be noted that, by writing down relations 
slight effects due to the possibility of a velocity gradi 
t If either of the relations D = eE and B = does i 
(76.9) is replaced by a different functional relation bet 


involving only the loc 



tween the vector sums 


the two 


he velocity, we neg 
:orresponding relaU®^ 

sides of the equation 
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mging from the fixed frame of reference K to another, K', which moves with the surface 
■ment considered, whose velocity along the normal u we denote by v„. The usual 
‘ iditions, namely that E', and H', are continuous, hold in the frame K'. By the relativistic 
nsformation formulae (see Fields, §24), these are equivalent to the continuity of the 
igential components of the vectors E + v x B/c and H - v x D/c. Taking the components 
mdicular to n and using equations (76.12), we obtain the required boundary 


nx(E2 

nx(H2 


-E,) = i;„(B2-B,)/c, 
H,)= -r„(D 2 -D,)/c. 


(76.13) 


If we substitute here the expressions (76.10) and (76.11), and neglect terms of higher 
order in v/c, we obtain 


nx(E2-EJ = r„(^2-l^i)H,/c, 

n X (H 2 - Hi) = - (£2 - £1 )E,/c. 


(76.14) 


In this approximation the values of H and E on the two sides of the surface need not be 
^inguished on the right-hand sides of equations (76.14). 

If the body moves so that its surface moves tangentially to itself (e.g. a solid of 
^olution rotating about its axis), then v„ = 0. Only in this case do the boundary 
petitions (76.13) or (76.14) reduce to the usual conditions that E, and H, be continuous. 


PROBLEMS 

^Problem 1. A dielectric sphere rotates uniformly in a vacuum in a uniform static magnetic field ft. 
Determine the resulting electric field near the sphere. 

Solution. In calculating the resulting electric field, the magnetic field may be taken to be the same as for a 
iphere at rest, since an allowance for the reciprocal effect of the magnetic field variation would give corrections of 
anigher order of smallness. Within the sphere, the magnetic field has the uniform value H"’ = 3§/(2 + n)- cf. (8.2). 

Since the rotation is steady, the resulting electric field is constant and, like any static electric field, has a 
^tential: E = - grad (t>. Outside the sphere, the potential satisfies the equation A = 0; inside the sphere, it 


A(7>''> = 2(£;i-l)n-H'‘Vc£, (1) 

»here Si is the anpar velocity. The latter equation is obtained from div D = 0 by substituting for D the 
"'***"' “ ^ The condition that the normal component ofD be continuous at the surface of 

sphere gives 


h^eais 








I the radius of the sphere and n a unit radial vector. 

6. F?™ “T ®5'™ttietry of the sphere, the required electric field is determined by only two constant vectors, SI and 
' ‘hese vectors we can form a bilinear scalar § • and a bilinear tensor f),Sl. + ftA 

ns?-SI, whose trace is zero. Accordingly, we seek the field potential outside the sphere in the form 


6 “‘dx.d: 




(3) 


Jere D is a constant tensor (with D, = 0). the electric quadruple moment tensor of the sphere (see Fields 
!)■ No term of the form constant/r can appear in </,<'>, since such a term would give a non-zero total electric 
^^fough a surface surrounding the sphere, whereas the sphere is uncharged. The field potential inside the sphere is 




-n-H">(r^- 


(4) 
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The first term is the solution of the homogeneous equation A (^> = 0, and the coefficient is chosen so as to gj^ 
continuity of the potential, and therefore of E„ at the surface of the sphere. Substituting (3) and (4) in (2), 
obtain 


Thus a quadrupole electric field is formed near the rotating sphere, and the quadrupole moment of the sphere is 
given by formula (5). In particular, if the axis of rotation (the z-axis) is parallel to the external field, D.j has only the 
diagonal components 




c (3 + 2£)(2 + ;i) 




Similarly, a quadrupole magnetic field occurs near a sphere rotating in a uniform electric field. The magnetic 
quadrupole moment is given by (5) if the sign is changed and e, /j, § are replaced by fi, e, G respectively. 

Problem 2. A magnetized sphere rotates uniformly in a vacuum about its axis, which is parallel to the 
direction of magnetization. Determine the resulting electric field near the sphere.! 


Solution. The magnetic field inside the sphere is uniform, and is expressed in terms of the constant 
magnetization M by the equations B"'+ 2H'‘> = 0 (cf. (8.1)) and - H‘-> = 47tM, whence B“> = 87tM/3, 
H‘'* = -47tM/3. The second of formulae (76.9) does not hold in this case, because the formula B = /jH is not 
valid for a ferromagnet at rest: from the first of (76.9) we have, inside the sphere. 


D = £E + £vxB/c-vxH/c • 


= £E + 47t(2£ + l)vxM/3c. 


The potential of the resulting electric field outside the sphere satisfies the equation A = 0, and that inside the 
sphere satisfies A = 87t(2£ + 1) Mn/3c£. 

The boundary condition that D„ be continuous at the surface of the sphere gives 


r”1 47t (2c + 1) r”1 

•£ - H-anMsin^e=- - 

L Sr 3c L dr 


where 6 is the angle between the normal n and the direction of and M (the z-axis). We seek and in the 


, D,^n,ni 

4> = = “T 

2r^ 4r^ 

r^ 47t(2£-l-l) 

(3 cos^e - 1) -I-- Mil (r^ - a^). 

4a* 9ce ^ 

From the boundary condition we obtain the following expressions for the electric quadrupole moment of the 
rotating sphere: 


4(2£ + 1) , 

D„ =- njt, D,, = n = - A D„, 

3c (2c-I-3) 

where Jit is the total magnetic moment of the sphere. For a metal sphere we must take £ -»c», giving 


§77. The dispersion of the permittivity 

Let us now go on to study the important subject of rapidly varying electromagneh*- 
fields, whose frequencies are not restricted to be small in comparison with the frequencies 

t If the direction of magnetization is not the same as that of the axis of rotation, the problem is considerably 
changed, since the sphere then emits electromagnetic waves. 
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which characterize the establishment of the electric and magnetic polarization of the 
substances concerned. 

An electromagnetic field variable in time must necessarily be variable in space also. For a 
frequency m, the spatial periodicity is characterized by a wavelength a c/w. As the 
frequency increases, A eventually becomes comparable with the atomic dimensions a. The 
macroscopic description of the field is thereafter invalid. 

The question may arise whether there is any frequency range in which, on the one hand, 
dispersion phenomena are important but, on the other hand, the macroscopic formulation 
still holds good. It is easy to see that such a range must exist. The most rapid manner of 
establishment of the electric or magnetic polarization in matter is the electronic 
mechanism. Its relaxation time is of the order of the atomic time a/r, where v is the velocity 
of the electrons in the atom. Since t> c, even the wavelength A -- acjv corresponding to 
these times is large compared with a. 

In what follows we shall assume the condition A ^ a to hold.t It must be borne in mind, 
however, that this condition may not be sufficient: for metals at low temperatures there is a 
range of frequencies in which the macroscopic theory is inapplicable, although the 
inequality c/m ^ a is satisfied (see §87). 

The formal theory given below is equally applicable to metals and to dielectrics. At 
frequencies corresponding to the motion of the electrons within ffie atoms (optical 
frequencies) and at higher frequencies, there is. indeed, not even a quantitative difference in 
the properties of metals and dielectrics. 

It is clear from the discussion in §75 that Maxwell’s equations 

div D = 0, div B = 0, (27-1) 

curl E = - (1 'c) d B/at, curl H == (1/c) d D/^f (77.2) 

remain formally the same in arbitrary variable electromagnetic fields. These equations are, 
however, largely useless until the relations between the quantities D, B, E and H which 
appear in them have been established. At the high frequencies at present under 
consideration, these relations bear no resemblance to those which are valid in the static case 
and which we have used for variable fields in the absence of dispersion. 

First of all, the principal property of these relations, namely the dependence of D and B 
only on the values of E and H at the instant considered, no longer holds good. In the 
general case of an arbitrary variable field, the values of D and B at a given instant are not 
determined only by the values of E and H at that instant. On the contrary, they depend in 
general on the values of E (t) and H (t) at every previous instant. This expresses the fact that 
the establishment of the electric or magnetic polarization of the matter cannot keep up 
with the change in the electromagnetic field. The frequencies at which dispersion 
phenomena first appear may be completely different for the electric and the magnetic 
properties of the substance. 

In the present section we shall refer to the dependence of D on E; the specific features of 
the dispersion of magnetic properties will be discussed in §79. 

The polarization vector P has been introduced in §6 by means of the definition 
p= - div P, p being the true (microscopic) charge density. This equation expresses the 


t The effects (called the natural optical activity) resulting from terms of the next order in the small ratio a ). will 
be considered in §§104-106. 
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electric neutrality of the body as a whole, and together with the condition P = 0 outside the 
body it shows that the total electric moment of the body is j PdV. This derivation is 
evidently valid for variable as well as for static fields. Thus in any variable field, even if 
dispersion is present, the vector P = (D — E)/47r retains its physical significance: it is the 
electric moment per unit volume. 

In rapidly varying fields, the field strengths involved are usually fairly small. Hence the 
relation between D and E can always be taken to be linear.f The most general linear 
relation between D(t) and the values of the function E(t) at all previous instants can be 
written in the integral form 

D (t) = E (t) + I / (t) E (t - T) dr. (77.3) 

It is convenient to separate the term E(t), for reasons which will become evident later. In 
equation (77.3)/ (r) is a function of time and of the properties of the medium. By analogy 
with the electrostatic formula D = eE, we write the relation (77.3) in the symbolic form 
D = cE, where e is a linear integral operator whose effect is shown by (77.3). 

Any variable field can be resolved by a Fourier expansion into a series of monochro¬ 
matic components, in which all quantities depend on time through the factor e For 
such fields the relation (77.3) between D and E becomes 

D = e(co) E, (77.4) 

where the function e(m) is defined as 


e(m) = 1 + j /(T)e‘“" dr. 


(77.5) 


Thus, for periodic fields, we can regard the permittivity (the coefficient of proportionality 
between D and E) as a function of the field frequency as well as of the properties of the 
medium. The dependence of e on the frequency is called its dispersion relation. 

The function e(m) is in general complex. We denote its real and imaginary parts by e an 


e(a)) = E '{ m ) + ie''{w). 


(77.6) 


From the definition (77.5) we see at once that 

e( — co) = £*(co). 


(77.7) 


(77.8) 


Separating the real and imaginary parts, we have 

e' (— £o) = £' (to), £" (— to) = — e" (to). 

Thus e'(to) is an even function of the frequency, and e"(to) is an odd function. 

For frequencies which are small compared with those at which the dispersion “egm ’ 
can expand e(to) as a power series in to. The expansion of the even function a (to) con 
only even powers, and that of the odd function e" (to) contains only odd powers. In the 


t Here we assume that D depends linearly on E alone, and not on H. In a constant field, a linear dependent 
D on H is excluded by the requirement of invariance with respect to time reversal. In a variable field, this con t 
1 longer applies, and a linear relation between D and H is possible if the substance possesses symme^^^ 


various kinds It is. however, a small effect of the order of a/A. of the kind mentioned in the last footnote. 
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gs CO -♦ 0, the function E(a») in dielectrics tends, of course, to the electrostatic permittivity, 
^^,hich we here denote by Eq- In dielectrics, therefore, the expansion of e' (co) begins with the 
^stant term Eq, while that of e"{w) begins, in general, with a term in to. 

The function e{to) at low frequencies can also be discussed for metals, if it is defined in 
such a way that, in the limit to -► 0, the question 

curlH= (l/cl^D/^t 

(^omes the equation 

curl H = 471(7 E/c 

for a static field in a conductor. Comparing the two equations, we see that for <u ^ 0 we 
must have ^ D/^t ^ Antr E. But, in a periodic field, dT>/dt = - itoe E, and we thus obtain 
the following expression for E(a») in the limit of low frequencies: 

e(<u) = Anitrjco. (77.9) 

Thus the expansion of the function e (<u) in conductors begins with an imaginary term in 
1/(0, which is expressed in terms of the ordinary conductivity a for steady currents.! The 
next term in the expansion of e (<u) is a real constant, although for metals this constant does 
not have the same electrostatic significance as it does for dielectrics.^ 

Moreover, this term of the expansion may again be devoid of significance if the effects of 
the spatial non-uniformity of the field of the electromagnetic wave appear before those of 
its periodicity in time. 

§78. The permittivity at very high frequencies 

In the limit as co -» oc, the function £(co) tends to unity. This is evident from simple 
physical considerations: when the field changes sufficiently rapidly, the polarization 
processes responsible for the difference between the field E and the induction D cannot 
occur at all. 

It is possible to establish the limiting form of the function £ (to) at high frequencies, which 
is valid for all bodies, whether metals or dielectrics. The field frequency is assumed large 
compared with the frequencies of the motion of all, or at least the majority, of the electrons 
in the atoms forming the body. When this condition holds, we can calculate the 
polarization of the substance by regarding the electrons as free and neglecting their 
jperaction with one another and with the nuclei of the atoms. 

The velocities v of the motion of the electrons in the atoms are small compared with the 
''clocity of light. Hence the distances u/cu which they traverse during one period of the 
^ctromagnetic wave are small compared with the wavelength c/co. For this reason we can 
Assume the wave field uniform in determining the velocity acquired by an electron in that 
field. 

The equation of motion is m dv'/dt = eE = cEqc” where e and m are the electron 
charge and mass, and v' is the additional velocity acquired by the electron in the wave field. 


t The imaginary part of the function e (co) is sometimes represented m the form (77.9) for all frequencies; this 
^mounts to introducing a new function cr(co), which has no physical significance apart from its relationship to 

t To avoid misunderstanding, we should point out a slight change in notation in comparison with §75 In 
Equation (75.10) for poor conductors, £(o) is (47tiCT/co) + £. 
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Hence V = ieE/mco. The displacement r of the electron due to the field is given by f = y 
and therefore r = —eE/mw^. The polarization P of the body is the dipole moment per unij 
volume. Summing over all electrons, we find P = Zer = —e^NE/mco^, where N is 
number of electrons in all the atoms in unit volume of the substance. By the definition of 
the electric induction, we have D = eE = E + 47rP. We thus have the formula 

e{co) = I—AnNe^/mw^. (78.i) 

The range of frequencies over which this formula is applicable begins, in practice, at the 
far ultra-violet for light elements and at the X-ray region for heavier elements. 

If e(co) is to retain the significance which it has in Maxwell’s equations, the frequency 
must also satisfy the condition to c/a. We shall see later (§124), however, that the 
expression (78.1) can be allotted a certain physical significance even at higher frequencies. 


§79. The dispersion of the magnetic permeability 

Unlike e(<u), the magnetic permeability /i(<u) ceases to have any physical meaning at 
relatively low frequencies. To take account of the deviation of g (m) from unity would then 
be an unwarrantable refinement. To show this, let us investigate to what extent the physical 
meaning of the quantity M = (B — H)/47r, as being the magnetic moment per unit volume, 
is maintained in a variable field. The magnetic moment of a body is, by definition, the 
integral 

^JrXpvdU (79.1) 

The mean value of the microscopic current density is related to the mean field by equation 
(75.7): 

curl B = — pv H-—. (79.21 

c c dt 

Subtracting the equation curl H = (1/c) c’D/^t, we obtain 

pv = c curl M-|-(^P/^t. ^ (79-^) 

The integral (79.1) can, as shown in §29, be put in the form jMdf’only if pv = ccurlM 
and M = 0 outside the body. 

Thus the physical meaning of M, and therefore of the magnetic susceptibility, depen 
on the possibility of neglecting the term c’P/dt in (79.3). Let us see to what extent the 
conditions can be fulfilled which make this neglect permissible. 

For a given frequency, the most favourable conditions for measuring the susceptibi i 
are those where the body is as small as possible (to increase the space derivatives in cur ^ 
and the electric field is as weak as possible (to reduce P). The field of an electromagne 
wave does not satisfy the latter condition, because E ~ H. Let us therefore consider 
variable magnetic field, say in a solenoid, with the body under investigation placed on 
axis. The electric field is due only to induction by the variable magnetic field, and the or ^ 
of magnitude of £ inside the body can be obtained by estimating the terms in the equatio 
curlE = -(l/c)dB/dt, whence E/l ~ wHIc or £ ~ (collc)H, where / is the dimension 
the body. Putting £ - 1 ~ 1, we have dP/dt ^ coE ~ w^lH/c. For the space derivatives o 
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(lie magnetic moment M = we have c curl M ~ cxH/7. If P/^tl is small compared 
^ |c curl M|, we must have 

■ (V9.4) 


[(is evident that the concept of magnetic susceptibility can be meaningful only if this 
^puality allows dimensions of the body which are (at least) just macroscopic, i.e. if it is 
J^patible with the inequality / > a, where a is the atomic dimension. This condition is 
certainly not fulfilled for the optical frequency range; for such frequencies, the magnetic 
^^ptibility is always ~ where v is the electron velocity in the atom;t but the optical 

frequencies themselves are ~ v/a, and therefore the right-hand side of the inequality (79.4) 

is - 

Thus there is no meaning in using the magnetic susceptibility from optical frequencies 
onward, and in discussing such phenomena we must put /r = 1. To distinguish between B 
and H in this frequency range would be an over-refinement. Actually, the same is true for 
many phenomena even at frequencies well below the optical range.J 

The presence of a considerable dispersion of the permeability makes possible the 
existence of quasi-steady oscillations of the magnetization in ferromagnetic bodies. In 
order to exclude the possible influence of the conductivity, we shall consider ferrites, which 
, are non-metallic ferromagnets. 

The term “quasi-steady” means, as usual (§58), that the frequency is assumed to satisfy 
the condition co^cjl, where / is the characteristic dimension of the body (or the 
“wavelength” of the oscillation). In addition, we shall neglect the exchange energy related 
to the inhomogeneity of the magnetization resulting from the oscillations; that is, the 
spatial dispersion (§103) of the permeability is assumed to be unimportant. For this, the 
dimensions / must be much greater than the characteristic length for the inhomogeneity 
energy: / p Ja, where a is of the order of the coefficients in (43.1). 

We can put H and B in the forms H = Hq -I- H', B = Bq -I- B', where Hq and Bq are the 
field and induction in the statically magnetized body, H' and B' the variable parts in the 
oscillations. When the displacement current is neglected, these variable parts satisfy the 
equations 

curl H' = 0, div B' = 0, (79.5) 

which differ from the magnetostatic equations only in that the permeability is now (for a 
^faonochromatic field xe“‘“') a function of the frequency, not a constant. § A ferro¬ 
magnetic medium is magnetically anisotropic, and its permeability is therefore a tensor 
/'n (m), which determines the linear relation between the variable parts of the induction and 
‘he field. 


This estimate relates to the diamagnetic susceptibility; the relaxation times of any paramagnetic or 
Biromagnetic processes are certainly long compared with the optical periods. It must be emphasized, however, 
that the estimates are made for an isotropic body, and must be used with caution when applied to ferromagnets. In 
•icular, the gyrotropic terms in the tensor /tjj which decrease only slowly (as 1 /to) with increasing frequency 
Problem 1) may be important even at fairly high frequencies, 
f This is discussed from a somewhat different standpoint in §103 below; see the second footnote to that 
section. 

§ These oscillations are therefore called magnetostatic oscillations. The theory has been given by C. Kittel 
•*947) for homogeneous (see below) magnetostatic oscillations and by L. R. Walker (1957) for inhomogeneous 
Ones. 


270 The Electromagnetic Wave Equations 

The first equation (79.5) shows that the magnetic field has a potential: H' = — grad 
Substituting in the second equation B\ = g.i^El\= — jdx^, we then obtain an 
equation for the potential within the body: 

Tik /^X; dxi^ = 0. (79.6) 

Outside the body, the potential satisfies Laplace’s equation A = 0; on the surface, H', 
and B\ must as usual be continuous. The first condition is equivalent to the continuity of 
the potential il/ itself; the second implies the continuity of !dx^^, where n is a unit 

vector along the normal to the surface. Far from the body, we must have -► 0. 

The problem thus formulated has non-trivial solutions only for certain values of the 
regarded as parameters. Equating the functions to these, we find the natural 

oscillation frequencies of the magnetization of the body, called the inhomogeneous 
ferromagnetic resonance frequencies. 

The simplest type of magnetostatic oscillation of a uniformly magnetized ellipsoid 
consists in oscillations which maintain the uniformity, the magnetization oscillating as a 
whole. To find their frequencies, it is not necessary to obtain a new solution of the field 
equations; they can be derived directly from the relations (29.14): 

Hi + n,y(By-Hy)= (79.7) 

where n^y is the demagnetizing factor tensor of the ellipsoid; H and B relate to the field 
within it, and § is the external magnetic field. The latter is assumed to be uniform; iii H and 
B, we again separate the oscillatory parts H' and B', which are now uniform throughout the 
body. For these we have 

H'i + niy{B’y-H’y) = 0 

{S,y + 47m,aik)H'y = 0, 

with the magnetic susceptibility tensor Xiki^) defined by g^y = S^y + Anxiy. Equating to zero 
the determinant of this system of linear homogeneous equations, we find 

det + Anna Xik (") I = 0, 

the roots of which give the natural oscillation frequencies. These ate called the 
homogeneous ferromagnetic resonance frequencies. 


PROBLEMS 


Problem 1. Using the macroscopic equation of motion of the magnetic moment (the Landau-Lifsh| 
equation; see SP 2, (69.9)), derive the magnetic permeability tensor for a uniformly magnetized uniaxi 
ferromagnet of the easy-axis type, in the absence of dissipation (L. D. Landau and E. M. Lifshitz. 1935). 


Solution. The equation of motion of the magnetization in a ferromagnet is 
M = y(H + /5M,v)xM. 

where y = g\e\/2mc (g being the gyromagnetic ratio), > 0 the anisotropy coefficient, and v a unit vector along 
the axis of easy magnetization (the z-axis). We write H = Ho + H', where H' is a small variable field in am 
direction, and a constant field which we take to be along the z-axis.t The transverse magnetization M, du 


t This field is 


with a view to applying the results in the subsequent Problems. 
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o H' is also small; = M = constant. Neglecting second-order small quantities, we find the equations 

— iroM^ = y(Hf^ + pM)M^ — yMH\. 

Dn determining and from these equations, we find the susceptibility (as the coefficients in the relations 
M'l = Xikti'k), and hence the permeability 

471 Wm{Wm + <Oh) 'i 

= 1-;-r = ti, /t,, = 1, 


(here = ypM, <«h = yHo- A ferromagnetic medium is thus gyrotropic, in the sense defined in § 101. 

Problem 2. Find the homogeneous ferromagnetic resonance frequencies for an ellipsoid with one principal 
axis along the axis of easy magnetization. An external field is applied in that direction (C. Kittel, 1947).t 

Solution. Within the ellipsoid there is a field Ho = &-47in<*>M along the z-axis (the axis of easy 
lagnetization); n^\ are the demagnetizing factors along the principal axes of the ellipsoid. A simple 
ilculation of the determinant (79.8) gives 


1 0)'*’ = y [Mp -E S’ + 47tAf (n'*’ - «'*’) ], 
o)<»> = y [M/?-E §-E 47 iM (7i<’'>-«<*>) ]. 
The homogeneous resonance frequency is therefore 


For example, in the case of a sphere 7i'*> = n*’’* = = i, and the resonance frequency is m = y{Mp + ^)-Jor “ 

B)lane-parallel slab whose surface is perpendicular to the easy-magnetization axis we have «<*> = n = 0, n * - 1, 
and the resonance frequency is m = y{Mp -E § - 47iM); the slab is magnetized if Mp+^> 4nM. 

Problem 3. Find the dispersion relation for magnetostatic oscillations in an infinite medium. 

I Solution. With the tensor from (1), equation (79.6) becomes 


fPutting tjj oc we find /((m) = -cot^fi, where 6 is the angle between k and the easy-magnetization axis (the z- 
axis). With /r(o)) from (!)(§ = 0), the oscillation frequency is 
w = yM{p + 4n sin^ 

This depends only on the direction of the wave vector, not on its magnitude. The result agrees, as it should, with 
the limit (as k -► 0) of the dispersion relation for spin waves in a ferromagnet; see SP 2, § 70. 

Problem 4. Find the inhomogeneous resonance frequencies in an infinite plane-parallel slab whose surface 
's perpendicular to the easy-magnetization axis. An external field § is applied in that direction. 

Solution. It is necessary to solve equation (2) for the potential within the slab, and the equation 
— 0 for the potential outside the slab, with the boundary conditions 


the z-axis is perpendicular to the slab surface, the plane z = 0 bisects the slab, and 2L is the slab thickness. Such a 
t In Problems 2-4 the permeability of the substance is assumed to be given by the equations (1). 
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solution may be an even or odd function of z. In the first case. 

= A cos k^z e-M, 

with = — k^ (the wave vector is in the xz-plane); the boundary conditions give the relation. 

tan fcjL = kjk^. qj 

In the second case, 

= A sin k^z = ± 

and the boundary conditions give 

tanlc,L= , 4 , 

The demagnetizing factor of the slab is = 1, and the demagnetizing field is therefore -4nM. With n{oi) from 
(1), we find the oscillation frequency 

01 ^ = y'^ (Mp + ^- 4nM) [Mp + ^-4nM cos^ 6), ( 5 ) 

where 6 is the angle between k and the z-axis. For any value of k^ there is a discrete infinity of k^ values given by the 
conditions (3) and (4). The corresponding frequencies are given by (5), and depend only on the ratio kjk^. All 
possible frequency values lie in the range 

y(Mp+ ^ — 4nM) ^ to ^ y [(Mp+ ^ — 4nM) {Mp+ §)]2. 

As k^ -► 0. only symmetrical oscillations are possible, and from (3) k^L ~ {k^Lf, i.e. is a second-order small 
quantity. Accordingly, we put 6 = 0 in (5) and obtain a frequency equal to the homogeneous resonance frequency, 
as it should be. 


§80. The field energy in dispersive media 


The formula 


S = cExH/47r 


(80.1) 


for the energy flux density remains valid in variable electromagnetic fields, even if 
dispersion is present. This is evident from the arguments given at the end of §30; on account 
of the continuity of the tangential components of E and H, formula (80.1) follows from the 
condition that the normal component of S be continuous at the boundary of the body and 
the validity of the same formula in the vacuum outside the body. 

The rate of change of the energy in unit volume of the body is div S. Using Maxwell’s 
equations, we can write this expression as 


-divs4(E.^ + H.3>) 

471V St dt J 


see (75.15). In a dielectric medium without dispersion, when e and g are real constants, this 
quantity can be regarded as the rate of change of the electromagnetic energy 

[/= (eE"+/iH^)/87t, (80-3) 

which has an exact thermodynamic significance: it is the difference between the interna* 
energy per unit volume with and without the field, the density and entropy remaining 
unchanged. ... . .he 

In the presence of dispersion, no such simple interpretation is possible. Moreover, in 
general case of arbitrary dispersion, the electromagnetic energy cannot be rationally 
defined as a thermodynamic quantity. This is because the presence of dispersion in genera 
signifies a dissipation of energy, i.e. a dispersive medium is also an absorbing medium- 
To determine this dissipation, let us consider a monochromatic electromagnetic field. By 
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^eraging with respect to time the expression (80.2), we find the steady inflow of energy per 
unit time and volume from the external sources which maintain the field. Since the 

g 'itude of the field is assumed constant, all of this energy goes to compensate the 
nation. Under the conditions stated, therefore, the time average of (80.2) is the mean 
tity Q of heat evolved per unit time and volume, 
bince the expression (80.2) is quadratic in the fields, all quantities must be written in real 
form. If, as is convenient for a monochromatic field, we take E and H to be complex, 
then in (80.2) we must substitute for E and dT>/dt respectively i(E + E*) and |( — icoeE 
+ icoe*E*), and similarly for H and dB/dt. On averaging with respect to time, the products 
E-E and E* -E*, which contain factors give zero, leaving 

Q=~ [(£* - e)E • E* + ~ (e"|Ep + p"\W). (80.4) 

loTi 

This expression can also be written 

Q = m(e"E^ + p”lP )/4n, (80.5) 

where E and H are the real fields, and the bar denotes an average with respect to time; cf the 
last footnote to § 59. 

It is easy to derive also a formula for the energy dissipation in a non-monochromatic 
field which tends sufficiently rapidly to zero as t ^ ± oo. In this case it is appropriate to 
consider the dissipation not per unit time but over the whole duration of the existence of 
the field. 

; Expanding the field E(f) as a Fourier integral, we write 
‘ E(t)= f E„e“'“'dm/27r, 
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On substituting e = e' + ie", the term in e' (w) vanishes in the integration, since the 
integrand is an odd function of w. With a similar expression for the magnetic field, we have 
finally 

I = ~ J «[e"(w)|E„P+/i"(co)|H„p]^; (80.6) 

the integral from — oo to oo can be replaced by twice the integral from 0 to oc. 

These formulae show that the absorption (dissipation) of energy is determined by the 
imaginary parts of e and g. The two terms in (80.5) are called the electric and magnetic losses 
respectively. On account of the law of increase of entropy, the sign of these losses is 
determinate: the dissipation of energy is accompanied by the evolution of heat, i.e. Q > 0. It 
therefore follows that the imaginary parts of e and g are always positive: 

e" > 0. g" > 0 (80.7) 

for all substances and at all (positive) frequencies.f The signs of the real parts of e and g for 
CO ^ 0 are subject to no physical restriction. 

Any non-steady process in an actual body is to some extent thermodynamically 
irreversible. The electric and magnetic losses in a variable electromagnetic field therefore 
always occur to some extent, however slight. That is, the functions £"(co) and g!' (co) are not 
exactly zero for any frequency other than zero. We shall see in § 81 that this statement is of 
fundamental importance, although it does not exclude the possibility of only very small 
losses in certain frequency ranges. Such ranges, in which e" and g" are very small in 
comparison with e' and g, are called transparency ranges. It is possible to neglect the 
absorption in these ranges and to introduce the concept of the internal energy of the body 
in the electromagnetic field, in the same sense as in a static field. To determine this 
quantity, it is not sufficient to consider a purely monochromatic field, since the strict 
periodicity results in no steady accumulation of electromagnetic energy. Let us therefore 
consider a field whose components have frequencies in a narrow range about some mean 
value coq. The field strengths can be written 

E = Eo (t)e-'“o<, H = Ho(t)c-“o', (80.8) 

where Eq (t) and (t) are functions of time which vary only slowly in comparison with the 
factor The real parts of these expressions are to be substituted on the right-hand 

side of (80.2), and we then average with respect to time over the period In/tOo, which is 
small compared with the time of variation of the factors Ep and Hp. 

The first term in (80.2), with E written in complex form, is 

^(E+E*)-Ki) + I>*)/47t, 

and similarly for the second term. The products E • D and E* • D* vanish when averaged 


t This statement applies to bodies which, in the absence of the variable field, are in thermodynainj^ 
equilibrium: we assume this condition to hold. If the body is not in thermal equilibrium, then Q may in 
be negative. The second law of thermodynamics requires only a net increase in entropy as a result of the „ 

the variable electromagnetic field and of the absence of thermodynamic equilibrium, the latter effect beiW 
independent of the presence of the field. An example of such a body is one in which all the atoms have been excite 
artificially (i.e. not by spontaneous thermal excitation, but by an external “pumping” field). 
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over time, and can therefore be ignored, leaving 


We write the derivative dD/dt as/E, where/ is the operator de/8t, and ascertain the effect 
of this operator on a function of the form (80.8). If Ep were a constant, we should have 
amply /E =/(a»)E, where/(a») = - ia»e(a»). We expand the function Eo(t) as a series of 
fourier components Eo„e“‘“', with constant Ep^. Since Ep(t) varies only slowly, this series 
will include only components with a cop. We can therefore put 


fcmming the Fourier components, we have 

I /Ep{t)c-'“c< =/(cop)EpC--o< + /^(^^o-‘“ot. 

II dcop ct 

mitting henceforward the suffix 0 to w, we thus obtain 

d(coe) ^Ep . , 

+ (80.lt 

ibstituting this expression in (80.9) and neglecting the imaginary part of e(w) gives 
1 d(we) / ^ dEp _ ^ gEp* ^ 1 d(ctje) d 


nee E-E* = Ep-Ep*. Adding a similar expression involving the magnetic field, we 
Conclude that the steady rate of change of the energy in unit volume is given by dI7/dt, 
where 


loTi / dm dm 


11' terms of the real fields E and H this expression can be written 

Y-, 1 rd(me)-^ d(m/i)— 

Brillouin, 1921). 

^ This is the required result: U is the mean value of the electromagnetic part of the internal 
tgy per unit volume of a transparent medium. If there is no dispersion, e and p are 
s ants, and (80.12) becomes the mean value of (80.3), as it should. 

Vht?^ Eternal supply of electromagnetic energy to the body is cut off, the absorption 
i>ch IS always present (even though very small) ultimately converts the energy U entirely 
heat. Since by the law of increase of entropy, there must be evolution and not 
^rption of heat, we must have L/> o. It therefore follows, by (80.11), that the 


d(m£)/dm > 0, 


d(m/r)/dm > 0 


(80.13) 
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must hold. In reality, these conditions are necessarily fulfilled, by virtue of more stringent ^ 
inequalities always satisfied by the functions e(co) and fi(oS) in transparency ranges (see the 
footnote to §84). . 

It should be emphasized once more that the expression (80.12) has been derived m me 
first approximation with respect to the frequencies a with which the amplitude Ep (t) vanes. 

It is therefore valid only for fields whose amplitude varies sufficiently slowly with time. This 
comment applies also to the calculation of the stress tensor in §81. 

§81. The stress tensor in dispersive media 
Another topic of considerable interest is the determination of the (time) averaged stress 
tensor giving the forces on matter in a variable electric field. We shall show that, even when 
dispersion is present (but absorption is again absent), the expression for this tensor 
contains no derivatives with respect to the frequency, unlike the energy equation (80.12). In 
particular, for a transparent isotropic dispersive fluid in a monochromatic electric field, the 
mean value is derived from (15.9) by simply replacing e by e(ai) and the products Efi^ 
and by their mean values and E^ (L. P. Pitaevskil, 1960). 

To prove this, we return to the proof given in §15 and reformulate it to some extent. We 
there considered a dielectric-filled plane capacitor and determined the stress tensor from 
the equality between the work done by the ponderomotive forces in a movement of the 
plates and the change in the appropriate thermodynamic potential. We now write this 
condition for total quantities (not those per unit area), in the form 

where A is the area of the capacitor plates. The potential ^ is here replaced by the ordinary 
energy the change in which is taken for fixed values of the entropy £T of the dielectric 
and of the total charges + e on the capacitor plates (instead of a fixed potential (p), and the 
result given by the theorem of small increments is used. 

In the form (81.1), this condition has a particularly simple significance: a thermally 
insulated capacitor with fixed charges on the plates is an electrically closed system. If 
mechanical work is done on it (to move the plates) by an external source, all the work goes 
to increase the energy of the capacitor. The latter is 

JU=%^^e^llC, (81-2) 

where is the energy of the dielectric in the absence of the field (for the same value of the 
entropy .V) and C is the capacitance. For a plane capacitor, C = cAjAnh, where h is the 
distance between the plates. Hence 

(^ "2')^,, = (^ ^ (^C)^. (81.3) 

On expressing 6C in terms of the displacement ^ of the plates (taking into account the 
dependence of e on the density of the dielectric, which is changed by the displacement), we 
immediately find (15.9)t; since the result is obvious, it will not be given here. 


t It is expressed in terms of different variables, with the adiabatic instead of the isothermal derivative d^/dp and 
function P^- The two forms are, of course, equivalent. 
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When dispersion is present, the expression for ^ is different. We shall show that the 
relation (81.3), however, remains valid for the time-averaged variation 3^, and this will 
complete the proof of the above statement concerning the averaged stress tensor. 

Let the charge on the capacitor plates vary monochromatically with frequency co. Then 
the capacitor itself is not an electrically closed system, since charge has to be supplied and 
withdrawn. The oscillatory circuit with natural frequency w consisting of the capacitor and 
an appropriately chosen self-inductance is an electrically closed system, howevert; the 
relation (81.1) is therefore valid for its energy. 

In the absence of resistance, the potential difference 4) between the capacitor plates is 
equal to the sum of the external e.m.f. and the self-inductance e.m.f.: 


(81.4) 


and the current J is related to the capacitor plate charge e by J = de/dt. For quantities 
varying monochromatically, the capacitance C(m) is by definition such that (j) = e/C(w). 
Putting iS” = 0, J = - ime, in (81.4) we find first of all that even when there is dispersion of 
the capacitance the circuit frequency again satisfies Thomson’s relation (62.5); 

m = c/V[LC(m)]. (81-5) 

Next, multiplying (81.4) by J = de/dt and considering almost monochromatic quantities 
as in the derivation of (80.12), we easily find 


- d fl 
^ “ dt [2 


72 , d(«C) 1- 
2 -' 


The form of this equation shows that the expression in the braces is the energy of the 
oscillatory circuit. The first term in this expression can be transformed by substituting 
J = -icoe and using (81.5): 




The final expression for the circuit energy isj 

(81.6) 

(D dCD ^ 

We have to calculate the variation of this energy for a small displacement of the capacitor 
plates, i.e., for a small change in the capacitance. In a variable field, this displacement is to 
be regarded as taking place with infinite slowness. In such a change, the adiabatic invariant 
given (as in any linear oscillatory system) by thejatio of the oscillation energy and the 
frequency remains constant.§ We thus have Si'W/co) = 0, i.e. 

= T/dcojc:). (81.7) 


St be small in comparison with the wavelength 

_pair the generality of the proof given. 

impmy me njimuiae, omit the “non-electromagnetic” part of the energy. 

54V. The invariance of this quantity is particularly clear from the quantum theory the ratio 
quantum number, which remains unchanged when the conditions vary adiabatically. 


t To meet the conditions for a quasi-steady state, the circuit , r ■ 

c/co. This limitation is not a fundamental one, and does not impair the generality of the 
}; From now on, to simplify the formulae, 

^ Cf. Mechanics, ~ ' 

Vjhw is the state 
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From (81.5), for a small change in the capacitance, 

3(dI(d= -^dCjC. (81.8) 

This change consists of two parts: 

SC = (SC)^ + (dC/dm)^m. (81.9) 

The first term is the static part, related to the strain as in the static case; here it is important 
that, in the presence of dispersion, the capacitance C {co) is expressed in terms of E(m) in the 
same way as in the static case. The second term depends only on the frequency change. 
From (81.8) and (81.9) we find as the static part 

1 d(m^C)^ 

(^C),, =-^ — -3(0. (81.10) 

dm 

When (81.6) is substituted in (81.7), and (81.10) is used, dC/dm disappears, and the energj 
variation becomes 

S^= -i^(^C)„= -^(?/C^){3C),„ (81.11) 

which is in fact the same as the averaged second term in (81.3). 

The disappearance from 3Tl oi the terms involving the derivative with respect to a is 
quite general and does not depend on a specific manner of change in the state of the body 
(in this case, the capacitor). In particular, for a dispersive medium formula (14.1) (with 
replaced by ) remains valid for the change in the free energy due to a small change in e: 

3^ = - j 3 e(co )^d F/87r. (8112) 

Knowing the stress tensor, we can use (75.17) to find the force on unit volume of the 
dielectric. The terms containing spatial derivatives coincide with the corresponding terms 
in the time-averaged expression (75.18), in which we must put ft = 1. The term in the time 
derivative (the Abraham force) is not the same. This term arises as the difference 


^|^(DxH)-|(ExH)j, 

47rc [dt dt J 

which is now to be averaged over time. To do so, we express D, E and H in complex f 
(i.e. replace them by y(D + D*) and so on), and then use (80.10) for c'D/8t. This gives 
Abraham force in the form 


— (£-l)re-(ExH*) + 


(H. Vashina and V. 1. Karpman, 1976). nare''4 

The stress tensor in a variable field is significant for absorbing as well as transp 
media, unlike the internal energy, for which the problem can be formulated 
neglecting the dissipation. There is, however, reason to suppose that in an abso 
medium the stress tensor cannot be expressed in terms of the permittivity alone. » 
therefore cannot be derived in a general form by macroscopic methods. I 
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jg2. The analytical properties of £(0;) 

The functionTlz) in (77.3) is finite for all values of z, including zero.t For dielectrics it 
Ijpds to zero as z -► 00 . This simply expresses the fact that the value of D(t) at any instant 
^not be appreciably affected by the values of E(t) at remote instants. The physical agency 
IPderlying the integral relation (77.3) consists in the processes of the establishment of the 
jlectric polarization. Hence the range of values in which the function /(z) differs 
fcreciably from zero is of the order of the relaxation time which characterizes these 
^tesses. 

. The above statements are true also of metals, the only difference being that the function 
I(t) - 4710-, rather than/(z) itself, tends to zero as z ^ 00 . This difference arises because the 
passage of a steady conduction current, though it does not cause any actual change in the 
ical state of the metal, in our equations leads formally to the presence of an induction 
D such that (l/c)c^D/dt = AnaE/c or 


D(t) = 47 : 0 -E (z)dz = 471(7 E(t —z)dz. 


We have defined the function e(co) by (77.5): 


£((m)=1 + e‘‘^^f(r)dr. 


It is possible to derive some very general relations concerning this function by regarding co 
as a complex variable (a; = a;' + ico”). These relations could be formulated immediately, 
since the dielectric susceptibility [£((«)— l]/47r is one of the generalized susceptibilities 
discussed in SP 1, §123. We shall nevertheless repeat here some of the arguments 
and results, both to assist the reader and to emphasize certain differences between 
fclectrics and metals. From the definition (82.1) and the above-mentioned properties of 
•he function T (z), it follows that £(cu) is a one-valued function which nowhere becomes 
■nfi^ite (i.e. has no singularities) in the upper half-plane. For, when cu" > 0, the integrand in 
2.1) includes the exponentially decreasing factor and, since the function/(z) is 

•'•te throughout the region of integration, the integral converges. The function e (w) has 
singularity on the real axis (co" = 0), except possibly at the origin (where, for metals, 
'“) has a simple pole). 

th^f the lower half-plane, the definition (82.1) is invalid, since the integral diverges. Hence 
'Unction £ (m) can be defined in the lower half-plane only as the analytical continuation 
ormula (82.1) from the upper half-plane, and in general has singularities. 

Pla ® ^ physical as well as a mathematical significance in the upper half- 

•h Si'^cs the relation between D and E for fields whose amplitude increases as e“ '. In 
lower half-plane, this physical interpretation is not possible, if only because the 
sence of a field which is damped as implies an infinite field for t - 00 . 

‘ is useful to notice that the conclusion that £( 0 ;) is regular in the upper half-plane is. 
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physically, a consequence of the causality principle. The integration in (77.3) is, on account 
of this principle, taken only over times previous to t, and the region of integration in 
formula (82.1) therefore extends from 0 to oo rather than from — oo to co. 

It is evident also from the definition (82.1) that 

E(-m*) = E*(m). (82.2) 

This generalizes the relation (77.7) for real co. In particular, for purely imaginary co we have 
e((m") = E*(im") (82.3) 

i.e. the function E(m) is real on the positive imaginary axis.t 

It should be emphasized that the property (82.2) merely expresses the fact that the 
operator relation D = £ E must give real values of D for real E. If the function E (t) is given 
by the real expression 

E = Eoe-'“' + Eo*e'"‘', (82.4) 

then, applying the operator e to each term, we have 

D = £(m) Eq e“‘“' + E(-m*) Eq* 

and the condition for this to be real is just (82.2). 

According to the results of §80, the imaginary part of E(m) is positive for positive real 
m = m', i.e. on the right-hand half of the real axis. Since, by (82.2), im e (— co') = - im e {co’ ), 
the imaginary part of £(co) is negative on the left-hand half of this axis. Thus 

im £ ^ 0 for CO = co' ^ 0. (82.5) 

At CO = 0, im £ changes sign, passing through zero for dielectrics and through infinity for 
metals. This is the only point on the real axis for which im e(co) can vanish. 

When CO tends to infinity in any manner in the upper half-plane, e (co) tends to unity. This 
has been shown in §78 for the case where co tends to infinity along the real axis. The genera 
result is seen from formula (82.1): if COoo in such a way that co" ^ co, the integral in (82. 
vanishes because of the factor e in the integrand, while if co" remains finite but 
|co'| 00 the integral vanishes because of the oscillating factor 

The above properties of the function £(co) are sufficient to prove the following theorem^ 
the function e (co) does not take real values at any finite point in the upper half-plane 
on the imaginary axis, where it decreases monotonically from Eq > 1 (for dielectrics o 
from -I- oc (for metals) at co = lO to 1 at co = ico. Hence, in particular, it follows (■ 

function £(co) has no zeros in the upper half-plane. We shall not repeat here the 
these results given in SP 1, §123; it need only be remembered that the general 
susceptibility is e(co)— 1, not E(co). 

We shall also not repeat the derivation of the relations between the imaginary an 
parts of e(co), but give only the final formulae, with the notation ^PP’^^P'^*! 
modified. We write the function £ (co) of the real variable co, as in §77, in t e 


t This is not in general true for the negative imaginary a 
and a cut along the negative axis may be necessary in order 
plane. Equation (82.2) then signifies only that c(co) has cot 


ixis. Here the function £(co) may have branch 
to define it as an analytic function in the lower 
mplex conjugate values on the two sides of the 
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t §82 The analytical properties of e(a)) 

{co) = £'((«) + ie"(c(;). If £((«) relates to a dielectric, the relations in question are 


£'(m)- 

i = ip f^d., 

71 J X-(D 

(82.6) 

£"(m) = 

_ip fiWiL'a, 

n J X-CO 

(82.7) 


(H. A. Kramers and R. de L. Kronig, 1927). It should be emphasized that the only important 
property of the function e(cu) used in the proof is that it is regular in the upper half-plane. 
Hence we can say that Kramers and Kronig’s formulae, like this property of E(m), are a 
direct consequence of the causality principle. 

i Using the fact that e" (x) is an odd function, we can rewrite (82.6) as 




(82.8) 


If a metal is concerned, the function E(m) has a pole at the point m = 0, near which 
E= A-nialo) (77.9). This gives an additional term in (82.7) (cf. SP 1, (123.18)): 


E"(m) 


= -ip f^d 

71 J X-CO 


(82.9) 


but (82.6) and (82.8) remain unchanged. A further remark is also necessary as regards 
metals. We have said at the end of §77 that there may be ranges of frequency for metals in 
which the function E(m) becomes physically meaningless on account of the spatial non- 
^formity of the field. In the formulae given here, however, the integration must be taken 
over all frequencies. In such cases e (cu) must be taken, in the frequency ranges concerned, as 
|he function obtained by solving the formal problem of the behaviour of the body in a 
ctitious uniform periodic electric field (and not in the necessarily non-uniform field of the 
(ectromagnetic wave). 

Formula (82.8) is of particular importance: it makes possible a calculation of the 
■Ction e' (cu) if the function e" (cu) is known even approximately (for example, empirically) 
given body. It is important to note that, for any function e"{co) satisfying the 
musically necessary condition e" > 0 for cu > 0, formula (82.8) gives a function e' (cu) 
^'^nsistent with all physical requirements, i.e. one which is in principle possible (the sign 
n magnitude of e are subject to no general physical restrictions). This makes it possible 
use formula (82.8) even when the function e" (m) is approximate. Formula (82.7), on the 
hand, does not give a physically possible function e" (to) for an arbFtrary choice of the 
^Vtibn E (cu), since the condition that £"(cu) > 0 is not necessarily fulfilled. 

In dispersion theory the expression for e' (m) is customarily written in the form 
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where e and m are the charge and mass of the electron, and/(co) dco is called the oscillator 
strength in the frequency range dco. According to (82.8), this quantity is related to £ (co) by 


/(") = 2;^ . 


(82.11) 


For metals,/(co) tends to a finite limit as co 0. 

For sufficiently large co, can be neglected in comparison with co in the integrand in 
(82.8). Then 


£'(£o)- 1 




(x)dx. 


For the permittivity at high frequencies, on the other hand, formula (78.1) holds, and ^ 
comparison shows that we have the sum rule 


^ ^ - J co£"(co)dco = J/(co) dco = N, 


(82.12) 


where N is the total number of electrons per unit volume. 

If £"(co) is regular at co = 0, we can take the limit co - 0 in formula (82.8), obtaining 


2 f £"(x) 

-r”*' 

0 

e" (co) (as in metals 
iply deleting the te 
'rand by £"(x)-4 

r dx 

P ^-^ = 0. 

J x^-co"' 


(82.13) 


If the point CO = 0 is a singularity of £"(co) (as in metals), the limit of the integral (82 8) as 
CO ^ 0 is not what is obtained by simply deleting the term m co. To calculate the liin t 
must first replace £"(x) in the integrand by a" (x) - 47r n/x; the value of the integral is 
unchanged, because ^ 


For a dielectric, formula (82.13) can be rewritten as 
^ 47re^ N —ry 


(82.14' 


where the bar denotes averaging with respect to the oscillator strength. 
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§83 


A plane monochromatic wave 


to those of e" (w) on the real axis: 


TlJ X"+, 


(82.15) 


Integrating this relation over all co, we obtain 

I [E (iw) - 1 ] dco = I e" (co) dco. (82.16) 

0 0 

All the above results are applicable, apart from slight changes, to the magnetic 
permeability p((D). The differences are due principally to the fact that the function /i(m) 
ceases to be physically meaningful at relatively low frequencies. Hence, for example, 
Kramers and Kronig’s formulae must be applied to p((o) as follows. We consider not an 
infinite but a finite range of co (from 0 to coj), which extends only to frequencies where p is 
still meaningful but no longer variable, so that its imaginary part may be taken as zero; let 
the real quantity /i(coi) be denoted by Then formula (82.8) must be written as 


p(w)-pi=^P j 


(82.17) 


Unlike Eq, the value p^ of p (0) may be either less than or greater than unity. The variation of 
g(to) along the imaginary axis is again a monotonic decrease, from p^ to p^ < p^,. 

Lastly, it may be notecl that the analytical properties of e(co) derived in this section are 
also possessed by jy(co) = 1/e (co). For example, fi(co) is analytic in the upper half-plane, 
because e(co) is analytic and has no zeros in that half-plane. The same Kramers-Kronig 
relations (82.6) and (82.7) apply to f;(co) as to e(co). 


§83. A plane monochromatic wave 

Maxwell’s equations (77.2) for a monochromatic field are 


^hese e( 


icop{co)H = c curl E, tcoE(co) E = — c curl H. (83.1) 

s they stand are complete, since equations (77.1) follow from (83.1) and 


uo not require separate consideration. Assuming the medium homogeneous, and 
rrninating H from equations (83.1), we obtain the second-order equation 


AE + e/i(cuVc^)E = 0 ; (83.2) 

^'i^ination of E gives a similar equation for H. 

rti consider a plane electromagnetic wave propagated in an infinite homogeneous 
^ edium. In a plane wave in a vacuum, the spatial dependence of the field is given by a factor 
, with a real wave vector k. In considering wave propagation in matter, however, it is in 


In fact, CO, must be such that cr 
•^amagnetic processes in a magnetic 


[ > 1, where 
material. 


the shortest relaxation 


for ferromagnetic and 
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general necessary to take k complex: k = k' + ik", where the vectors k' and k" 
are real. 

Taking E and H as proportional to c'‘‘ and carrying out the differentiation with respect 
to the coordinates in equations (83.1), we obtain 

co/iH = ckxE, cueE=— ckxH. (83.3) 

Eliminating E and H from these two equations, we obtain for the square of the wave vector 
= k'^ - k"^ + 2ik' • k" = Egco^jc'^. (83.4) 

We see that k can be real only if e and g are real and positive. Even then, however, k 
may still be complex if k' • k" = 0 ; we shall meet with such a case in discussing total re¬ 
flection in § 86 . 

It must be borne in mind that, in the general case of complex k, the term plane wave is 
purely conventional. Putting we see that the planes perpendicular to 

the vector k' are planes of constant phase. The planes of constant amplitude, however, are 
those perpendicular to k", the direction in which the wave is damped. The surfaces on 
which the field itself is constant are in general not planes at all. Such waves are called 
inhomogeneous plane waves, in contradistinction to ordinary homogeneous plane waves. 

The general relation between the electric and magnetic field components is given by 
formulae (83.3). In particular, taking the scalar product of these formulae with k, we obtain 
k-E=0, k-H = 0, (83.5) 


and, squaring either and using (83.4), 

E^ = gU^/e. (83.6) 

It must be remembered, however, that because all three vectors k, E and H are complex 
these formulae do not in general have the same evident significance as when the vectors are 


We shall not give the cumbersome relations valid in the general case, but consider on y 
the most important particular cases. Especially simple results are obtained for 
propagated without damping in a non-absorbing (transparent) homogeneous m lu 
The wave vector is real, and its magnitude is 


k = yJ(Eg)colc = nco/c, 


(83.7) 


where n = J (Eg) is called the refractive index of the medium.. The electric and magnet 
fields are both in a plane perpendicular to the vector k (a pure transverse wave); they 
mutually perpendicular, and are related by 


/ (83.8) 

H=V(£//r)lxE, 

where 1 is a unit vector in the direction of k. Hence it follows that eE • E* = /rH • ^ 
this does not mean (as it would in the absence of dispersion) that the electric an .^^,5 

energies in the wave are equal, since these energies are given by different exP 
(namely, the two terms in formula (80.1!)). The total electromagnetic energy density 
case may be written 

r, 1 d /i^E-E*. (83-^’ 
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The velocity u with which the wave is propagated in the medium is given by the familiar 
expression for the group velocityit 


Here u = S'lJ,m accordance with its significance as the velocity of transfer of energy in the 
wave packet; U is the energy density given by formula (83.9), and 

S = |;yiE.E. ,83.11) 

is the mean value of the Poynting vector. In the absence of dispersion, when the refractive 
index is independent of frequency, the expression (83.10) becomes simply c/n; cf (75.13). 

Next, let us consider a more general case, the propagation of an electromagnetic wave in 
an absorbing medium, the wave vector having a definite direction (i.e. k' and k" being 
parallel). Then the wave is literally plane, since the surfaces of constant field in it are planes 
perpendicular to the direction of propagation (a homogeneous plane wave). 

In this case we can introduce the complex “length” k of the wave vector, given by k = kl 
(1 being a unit vector in the direction of k' and k"), and from (83.4) we have k = y/(ep) co/c. 
The complex quantity ^ (ep) is usually written in the form n + m, with real n and k, so that 

k = (o/c = (n + Ik) co/c. (83.12) 

The quantity n is called the refractive index of the medium, and k the absorption coefficient: 
the latter gives the rate of damping of the wave during its propagation. It should be 
emphasized, however, that the damping of the wave need not be due to true absorption; 
dissipation of energy occurs only when e or /x is complex, but k may be dilferent from zero if 
£ and p are real and negative. 

We may express n and k in terms of the real and imaginary parts of the permittivity 
(taking = 1). From the equation 


e have n —k^ = e'. Inn = e". Solving these equations for n and k, we havej 

particular, for metals and in the frequency range where formula (77.9) is valid, the 
^m^inary part of e is large compared with its real part, and is related to the conductivity by 
- 4n(7/co: neglecting e' in comparison with e", we find that n and k are equal, in 
agreement with (59.4); 


When considerable absorption occurs, the group velocity c 
packets are not propagated but rapidly “ironed out”. 

+ Since c" > 0, the signs of n and k must be the same, in accorda 
^N^on of propagation. The choice of positive signs in (83.13) cc 


(83.14) 
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The relation between the fields E and H in this homogeneous plane wave is again givg^ 
by formula (83.8), but e and g are now complex. The formula again shows that the tvvo 
fields and the direction of propagation are mutually perpendicular. If /x = 1, we write 
= ^(n^ + K^) exp [j tan” ^ (k/u)], which shows that the magnetic field is yj{n^ + 
times the electric field in magnitude and tan”^ (k/k) from it in phase; in particular, when 
(83.14) holds, the phase difference is ^tt. 


PROBLEM 

At a given instant t = 0 an electromagnetic perturbation occurs in some region of space. The perturbation is 
not maintained by external agencies, and is therefore damped in time. Find the damping decrement. 

Solution. We expand the initial perturbation as a Fourier integral with respect to the coordinates, and 
consider a component having a (real) wave vector k. The time dependence of this component is given, for 
sufficiently large t, by a factor with a complex frequency m, which is to be determined; the damping 

decrement is — im w. 

From the equations - H/c = curl E = ik x E, D/c = curl H = ik x H we have, eliminating H, 

D/c'' = kx(kxE). (1) 

We take the direction of k as the x-axis. The longitudinal part of the perturbation therefore satisfies = 0, 
whence = 0. 

The relation between D, and £, is given by an integral operator: 

EJt) = £~‘Dy= j F(t-z)DJr)dT. (2) 

Since we have D,(t) = 0 for t > 0 (there are no field sources when t > 0), it follows that 
£,(0= J F(t-T)DjT)dT. 

Hence we see that, for large t, the time dependence of £, is given essentially by that of the function F(l). 

For a monochromatic field, (3) gives 

jF(T)c“»'dT, 

£((B) 0 


F(t) = 


- 

271 J £(m) 


To estimate this integral for large t. we displace the path of integration into the lower half-plane of ^ 
integrand decreases rapidly. The singularities of the function 1/e (m). i.e. the zeros and branch ' "larity 

be excluded from the contour. The integral is then essentially proportional to e -“o .where coo is the 
nearest the real axis. This gives the solution for the longitudinal part of the perturbation 

For the transverse components, we have from (1) D,.,/c^ + . = «• ^ similar an^y« ,jes 

the required “frequency” cOg is in this case 
nearest the real axis. 


ir branch point of the function cc 


§84. Transparent media ^ 

Let us apply the general formulae derived in §82 to media which absorb only 
given range of frequencies, i.e. assuming that for these frequencies the imaginary pa 

permittivity may be neglected. , .o-i ■ tUonoU'' 


permittivity may be neglected. . .o-, ■ tUonoU'’ 

In such a case there is no need to take the principal value m formula (82.8), since tne 
X = CO does not in practice lie in the region of integration. The integral can tne 
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§84 

j^erentiated in the usual way 


with respect to the parameter to, giving 


de 4cu 
dm n 


xe" (x) 


dx. 


Since the integrand is positive throughout the region of integration, we conclude that 
de(m)/dm>0, (84.1) 

.e. if absorption is absent the permittivity is a monotonically increasing function of the 
" jquency. 

Similarly in the same frequency range we obtain another inequality. 


dm 


de/dm>2(l-e)/m. (84.2) 

If £ < 1, this inequality is more stringent than (84.1).t 
It may be noted that the inequalities (84.1) and (84.2) (together with the corresponding 
ones for p (m)) ensure that the inequality w < c is satisfied by the velocity of propagation of 
waves. For example, if /£ = 1 we have n = ^Je and, replacing e by n^ in (84.1) and (84.2), 

d (nm)/dm > n, d(nm)/dm > l/n. (84.3) 

Thus we obtain two inequalities for the velocity u (83.10): u < cjn and u < cn. whence 
n< c whether n < 1 or n > 1. These inequalities also show that u > 0, i.e. the group 
velocity is in the same direction as the wave vector. This is quite natural, even if not logically 
necessary. 

Let us suppose that the weak absorption extends over a wide range of frequencies, from 
to mj O mj, and consider frequencies m such that mj m mj. The region of 
'ntegration in (82.8) divides into two parts, x < cl>i and x > m 2 . In the former region we can 
X in comparison with m, and in the latter region m in comparison with x, in the 
^■>minator of the integrand: 

£(m)= 1+^ J e"(x)~-^ J xe"(x)dx, (84.4) 


^the function e(m) in this range is of the form a — bjap-, where a and b are positive 
plants. The constant b can be expressed in terms of the oscillator strength 
I sponsible for the absorption in the range from 0 to co^ (cf. (82.12)): 

e (m) = a - 4nN^ e^jmco^. (84.5) 

•rom this expression it follows, in particular, that, when the region of weak absorption is 
ftciently wide, the permittivity in general passes through zero. In this connection it 
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should be recalled that a literally transparent medium is one in which e (w) is not only rea| 
but also positive; if e is negative, the wave is damped inside the medium, even though uj, 
true dissipation of energy occurs. 

For the frequency at which e = 0 the induction D is zero identically, and Maxwell’s 
equations admit a variable electric field satisfying the single equation curl E = 0, with zero 
magnetic field. In other words, longitudinal electric waves can occur. To determine their 
velocity of propagation, we must take into account the dispersion of the permittivity not 
only in frequency, but also with respect to the wave vector; we shall return to this topic ij. 

Lastly, suppose that within the broad transparency region there is a narrow aosorptioi, 
line near some frequency cOo- Let us consider a neighbourhood of this frequency for which 

y <1 Im-cuol <1 cuo, (84.6) 

where y is the line width. In this range, x may be replaced by coo in the integrand in (82.6) 
everywhere apart from the rapidly varying function e''(CL>). Then 

e (m) S e' (m) ^ ^ ^ e" (x) dx, (84.7) 

7l(COo-CO) J 

the integration being taken over the absorption line. 


A plane electromagnetic wave with a sharply defined forward front is incident normally on the boundary of a 
half-space (x > 0) occupied by a transparent medium with = 1. Determine the structure of the front o 


transmitted wave (A. Sommerfeld and L. Brillouin, 1914). 

I'e be incident on the boundary of the medium 
fc 


t = 0 ,sothatatx = 0thefieU 

■ a Fourier integral 


Solution. Let the wi- . 

(E or H) of the incident wave is £ = 0 for t < 0, £ oc e” ‘“o' for t > 0. Expanding this field as 
with respect to time, we reduce the problem to that of waves of various frequencies and infinite extent inciuen 
the boundary. The amplitude of the Fourier component with frequency to is proportional to 


ismittedwavei 

ti(to) is a slowly varying function of frequency. Hence the w 


When a wave of frequency to is incident, the transmitted wave is of the form a (to) e where the , 

• ■ - --c-'-i the medium in the present proDic 


£oc J dwa{w)e~ 


Ic j gi(Ol-t0„)TdT. 


In the region near the wave front, the important values of to in this integral are ^-“portal* 

variable i = to - to^, we replace a(to) by a(tOe), and expand the exponent in powers of Omitting unimp 
constants and phase factors, we have 


xj jexp{ii(T-t + ^)-K^x^}d«di 


where u = u(to„) is the velocity of propagation (83,10), and W = [du/dto]^_„. Effecting the integration o 
we easily bring E to the form 


£ oc I ^ ± w = (x — ut)/yj (2x I w I ), 
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,he sign in the exponent depending on that of The intensity distribution near the wave front is given by 
/oc|j£;ri'df;| . 

This expression is of the same form as that which gives the intensity distribution near the edge of the shadow in 
fresnel diffraction (see Fields, §60). For w > 0 the intensity decreases monotonically with increasing w, but for 
^ 0 it oscillates with decreasing amplitude about a constant value to which it tends as w - oo. 

At large distances preceding the front here considered there are found “precursors” propagated with velocity c. 
These correspond to the high-frequency Fourier components, for which £ -► 1. 


THE PROPAGATION OF 
ELECTROMAGNETIC WAVES 


§85. Geometrical optics 

The condition for geometrical optics to be applicable is that the wavelength X should be 
small in comparison with the characteristic dimension / of the problem (see Fields, §53), 
The relation between geometrical and wave optics is that, for A <^ /, any quantity 4> which 
describes the wave field (i.e. any component of E or H) is given by a formula of the type 
0 = ae^, where the amplitude o is a slowly varying function of the coordinates and time, 
and the phase ij/isa. large quantity which is “almost linear” in the coordinates and the time; 
it is called the eikonal, and is of great importance in geometrical optics. The time derivative 
of \li gives the frequency of the wave: 

d\j/ldt = -CO, (85.1) 

and the space derivatives give the wave vector 

grad^ = k, (85-2) 


and consequently the direction of the ray through any point in space. 

For a steady monochromatic wave, the frequency is a constant and the time dependence 
of the eikonal is given by a term — cot. We then introduce a function ij/^ (also called the 


eikonal), such that 


ij/ = -oot + (oo/c) i //1 (x, y, z). 


(85.3) 


Then ij/i is a function of the coordinates only, and its gradient is 
grad^i = n. 


(85.4) 


where n is a vector such that 

k = oon/c. 

The magnitude of n is equal to the refractive index n of the medium, t Hence 
equation for the eikonal in ray propagation in a medium with refractive index n{x,y^ 
(a given function of the coordinates) is 
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The equation of ray propagation in a steady state can also be derived from Fermat’s 
principle, according to which the integral 





along the path of the ray between two given points A and B has a value less than for any 
other path between A and B. Equating to zero the variation of this integral, we have 

B 

3}j/i = [(^nd/ + n^d/) = 0. 


Let be a displacement of the ray path under the variation. Then 5n = ^r-grad n, 
3il= I -d^r, where 1 is a unit vector tangential to the ray. Substituting in 3\j/i and 
integrating by parts in the second term (using the fact that = 0 at .4 and B), we have 


3^1 


B B 

j*• gradnd( + J nl*d^r 
j* ^gradn- 


d(nl)\ 


Hence 

d(nl)/d/ = grad n. (85.7) 

Expanding the derivative and putting dn/dl = 1 - grad n, we obtain 


Jl 


[gradn- 1(1 -gradn)]. 


(85.8) 


This is the equation giving the form of the rays. 

We know from differential geometry that the derivative dl/d/ along the ray is equal to 
N/R, where N is the unit vector along the principal normal and R the radius of curvature. 
Taking the scalar product of both sides of (85.8) with N, and using the fact that N and 1 are 
perpendicular, we have 


gradn 


(85.9) 


the rays are therefore bent in the direction of increasing refractive index. 

The velocity of propagation of rays in geometrical optics is in the direction of 1 , and is 
given by the derivative 

u = dw/Bk. (85.10) 

This is also called the group velocity, the ratio co/k being called the phase velocity. However, 
the latter is not the velocity of physical propagation of any quantity. 

It is easy to derive also the equation which gives the rate of change of the radiation 
intensity along a ray. The intensity I is the magnitude of the (time) averaged Poynting 
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vector. This vector, like the group velocity, is in the direction of 1: S = /I. In a steady state, 
the mean field energy density is constant at any given point in space. The equation of 
conservation of energy is therefore div S = 0, or 

div(/l) = 0. (85.11) 

This is the required equation. 

Finally, let us consider how the direction of polarization of linearly polarized radiation 
varies along a ray (S. M. Rytov, 1938). As we know from differential geometry, a curve in 
space (in this case, the ray) is characterized at every point by three mutually perpendicular 
unit vectors along the tangent (1), the principal normal (N) and the binormal (b), which 
form the natural trihedral. Since the electromagnetic waves are transverse, the vectors E 
and H are always coplanar with N and b. 

Let the direction of E at some point on the ray be the same as that of N, i.e. let E lie in the 
osculating plane (that of N and 1). The deviation of the curve from the osculating plane 
over a length d/ is of the third order of smallness with respect to dl. We can therefore say 
that, over a length d( of the ray, the vector E remains in the original osculating plane. The 
osculating plane at the other end of dl is inclined to the original one at an angle d^ = dl/T, 
where T is the radius of torsion. This is therefore the angle turned through by the vector E 
relative to N in the normal plane. Thus, over a distance dl along the ray, the direction of 
polarization rotates in the normal plane, its angle to the principal normal varying in 
accordance with the equation 

d^/dl = l/T. (85.12) 

In particular, when the torsion is zero, i.e. the ray is a plane curve, the direction of the 
vector E in the normal plane is constant, as is in any case evident from symmetry. 


PROBLEMS 

Problem 1. Find the law of transformation for the velocity of light (the group velocity) in a medium when 
the frame of reference is changed. 

Solution. By the definition of the group velocity u, dto = u • dk and dto' = u' • dk', the primed and unprimcd 
quantities referring to the frames K' and K; K' moves with velocity v relative to K. According to the Loren z 
transformation formulae for a wave four-vector, 

T^ = y{K-vo}lc^), k\ = k^ k\ = K, co = y{w' + vk'd, 
where y = 1/^(1 the x and x' axes are parallel to v. The last of these expressions gives 

dto = y{dco' + vdk\) = y(u'-dk' + idk',). 

Substituting dk' in terms of dk and dto from the other expressions, and collecting terms in dto, gives 
y(l +vv'JP)do} = y(w',+ r)d/c„+ u'ydkj+ u'jdkj. 

Comparison with dto = u • dk shows that the velocities u' and v give u by the usual relativistic formulae for th 
addition of velocities, as expected. 

Problem 2. Determine the velocity of propagation of light in a medium moving relative to the observe 

Solution. Let to and k be the frequency and wave vector of the light wave in a fixed frame of ■'^retiW 
and to', k' the corresponding quantities in a frame K' moving with the medium at velocity v relative to K. m 
frame K' the medium is at rest, and to' and k' are therefore related by 
ck'^m'mto'). 

According to the Lorentz transformation formulae for a wave four-vector we have, as far as terms of the fif* 
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order in r/c,to' =to —k-v,k' = k—rov/c^, k' = k—c. 
(1) and expanding the function n(to') in powers of v. 


• l/c^, where I = k/k. Substituting these expressions ir 
^e obtain to the same accuracy t 


ru vlto/ d(nc£))\ 


Xhe velocity of propagation (the group velocity) in a medium at rest is found by differentiating the relation 
= con{m): 


d(nto)/dto 

In a moving medium, it is found by differentiating the relation (2), which we first rewrite as 


k 


(3) 


Again as far as the first-order terms. 


(4) 


For propagation in the direction of motion (v||l), we therefore havej 

u = Uo + v{l- ul/c^) - (vncijc) AuJAm. (5) 

The first two terms can be derived by simply applying the relativistic expression for the addition of velocities. 
If V and I are mutually perpendicular, then 

u = Uo-l-v(l-uo/cn). (6) 


The phase velocity of the wave is given by (2): 

when V X 1, there is no first-order effect. 


§86. Reflection and refraction of electromagnetic waves 

Let us consider the reflection and refraction of a plane monochromatic electromagnetic 
'vave at a plane boundary between two homogeneous media. Medium I, from which the 
'vave is incident, is assumed transparent, but not medium 2 (for the present). Quantities 
pertaining to the incident and reflected waves will be distinguished by the suffixes 0 and 1 
•■espectively, and those for the refracted wave by the suffix 2 (Fig. 46, p. 294). The direction 
the normal from the boundary plane into medium 2 is taken as the z-axis. 

Since there is complete homogeneity in the xy-plane, the dependence of the solution of 
fte field equations on x and y must be the same in all space. The components k^, of the 
""ave vector must therefore be the same for all three waves. Consequently, the directions of 
Propagation of the three waves lie in one plane, which we take as the xz-plane. 

From the equations 


( 86 . 1 ) 


. ^ The second term in (2), and therefore all first-order effects, vanish identically when = e = 
"constant/co^. 

This formula represents the Fizeau effect, first predicted by A. Fresnel (1818). The influence of dispersion on 
'’'is effect was discussed by H. A. Lorentz (1895). 
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we take = 1 in both media. The vector kp is, by definition, real, and so is ki- The quantity 
k2z, however, is complex in an absorbing medium, and the sign of the root must be taken so 
that im k2z > refracted wave being damped towards the interior of medium 2. 

If both media are transparent, equations (86.1) give the familiar laws of reflection and 
refraction: 

e. = e„, = (86.3) 

‘ Sin 00 v ”2 


To determine the amplitudes of the reflected and refracted waves, we must use tn 
boundary conditions at the surface of separation (z = 0), and we shall consider separate 
the two cases where the electric field Ep is in the plane of incidence and perpen^cular 
that plane; from the results we can obtain the solution for the general case, where Ep can 
resolved into components in these two directions. 

Let us first suppose that Ep is perpendicular to the plane of incidence. It is evi en 
symmetry that the same will be true of the fields Ej and E2 m the reflected an re r ^ 
waves. The vector H is in the xz-plane. The boundary conditions requiret the continui y 
E=E and H,; by (83.3) - ck.EJw. 

The field in medium 1 is the sum of the fields in the incident and reflected wav , 
we obtain the two equations £p + £i = £2^ KfE^-Ei) = The E 

^ - ■ '-"11 three waves. In what tollovvs. 


n £ cancel because (and therefore to) is the same 


signifies the complex amplitude of a wave. 


The solution of the above equations gi'' 


^g6 Reflection and refraction of electromagnetic waves 

fresnel’s formulae: 

£ feoz-^2z ^ Vei cos gp - y(£2 - £i sin^go) ^ 

‘ ^oz + ^2z ° ^eiCOseo + \/(£2-eiSin^eo) 


£2 


2fco^ ^ 2^61 cos gp _^ 

^0z + ^2z ° COS Sp + ^(£2 - El sin^Sp) 


If both media are transparent, these formulae become, by (86.3), 


£1 = 


sin(6,-6p) ^ 
sin (62 + 6p) 


2 cos 6 q sin 62 ^ 
sin (62 + 60) 
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(86.4) 


(86.5) 


The case where E lies in the plane of incidence can be discussed similarly. Here it is more 
convenient to carry out the calculations for the magnetic field, which is perpendicular to 
the plane of incidence. A further two Fresnel’s formulae are obtained: 


E2/coz-£ife2z ^ £2 COS dp - y(£i£2 " £i^ sin^6p) ^ 
£2^0z + £i^2z ° £2COs6p + ^(£l£2-£i^sin^6p) 


( 86 . 6 ) 


H2 


^ ^ _ 2£2COs6p _^ 

e2^oz + £i^2z ° £2cos6p + ^(£i£2-£i^sin^6o) 


If both media are transparent, these formulae may be written 

jj tan(6p-62) jj 

^ tan (6p + 62) 


5^(60 + 62)^05(60-62) 


(86.7) 


The reflection coefficient R is defined as the ratio of the (time) averaged energy flux 
•■eflected from the surface to the incident flux. Each of these fluxes is given by the averaged 
^-component of the Poynting vector (83.11) for the wave in question, 

p yEtCos6t|Eip JEtp 

yciCoseplEpI^ |Ep|"' 

For normal incidence (6p = 0) the two modes of polarization are equivalent, and the 
Reflection coefficient is given by 


R = 


lx/ei + x/e2l 


This formula is valid whether the reflecting medium is transparent or not. If 


( 86 . 8 ) 
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y/si — ^2 + if medium 1 is a vacuum (e^ = 1), then 


p _ in2~ If + Kf^ 
(^2 + f)^ + K2^ 


(86,9) 


The remaining discussion assumes that both media are transparent. The following 
general remark should be made first of all. The boundary between two different media is in 
reality not a geometrical surface but a thin transition layer. The validity of the formulae 
(86.1) does not rest on any assumptions concerning the nature of this layer. The derivation 
of Fresnel’s formulae, on the other hand, is based on the use of the boundary conditions 
and assumes that the thickness 5 of the transition layer is small compared with the 
wavelength A. The thickness 3 is usually comparable with the distances between the atoms, 
which are always small compared with A if the macroscopic description of the field is 
legitimate, and so the condition A > ^ is usually fulfilled. In the opposite limiting case the 
phenomenon of refraction is entirely different in character. For ^ > A, geometrical optics is 
valid (A being small compared with the dimensions of the inhomogeneities in the medium). 
In this case, therefore, the propagation of the wave can be regarded as the propagation of 
rays which undergo refraction in the transition layer but are not reflected from it. The 
reflection coefficient, therefore, would be zero. 

Let us return now to Fresnel’s formulae. In reflection from a transparent medium, the 
coefficients of proportionality between Ei, E2 and Eo in these formulae are real.t This 
means that the wave phase either remains unchanged or changes by tt, depending on the 
sign of the coefficients. In particular, the phase of the refracted wave is always the same as 
that of the incident wave. The reflection, on the other hand, may be accompanied by a 
change in phase.f For example, with normal incidence the phase of the wave is unchan^ 
if Ej > £2, but if £j < £2 the vectors Ei and Eq are in opposite directions, i.e. the wave phase 
changes by n. 

The reflection coefficients for oblique incidence are, by (86.5) and (86.7), 


P .. sin" (^2-^0) ^ tan" (62-^0) 

sin^(e2 + eo)’ " tan^ (62 + ^0)' 


( 86 . 10 ) 


Here, and in what follows, the suffixes J_ and {| refer to the cases where the field E is 
respectively perpendicular and parallel to the plane of incidence. The expressions (86.10) 
are unaltered when 62 and gp are interchanged (but the phase of the reflected wave changes 
by 7 t. as is seen from formulae (86.5) and (86.7)). The reflection coefficient for a wave 
incident from medium 1 at an angle Op is therefore equal to that for a wave incident from 
medium 2 at an angle 62- 

An interesting case is the reflection of light incident at an angle 6p such that 6p + (I2 = 2^ 
(the reflected and refracted rays being thus perpendicular). Let this angle be 6^, sin 6p ^ 
sin {jTz - 62) = cos 62, and the law of refraction (86.3) gives 

tanflp = ^(£2/61). (86.11) 


t We ignore for the moment the possibility of total reflection (see below). 

} Reflection from an absorbing medium leads in general to the appearance of elliptical polarization. 1 
explicit expressions for the amplitude and phase relations between the three waves are then extremely involw 
they are given by J. A. Stratton, Electromagnetic Theory, Chapter IX, McGraw-Hill, New York, 1941. 
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for 00 = 0p we have tan (02 + 0o) = K# = 0. Hence, whatever the direction of 

^arization of light incident at this angle, the reflected light will be polarized so that the 
^tric field is perpendicular to the plane of incidence. The reflected light is polarized in 
this way even when the incident light is natural: no component with any other polarization 
js reflected. The angle 6^ is called the angle of total polarization or the Brewster angle. It 
^Buld be noticed that, whereas natural light can be totally polarized by reflection, this 
effect cannot be produced by refraction, whatever the angle of incidence. 

The reflection and refraction of plane-polarized light always results in plane-polarized 
light, but the direction of polarization is in general not the same as in the incident light. Let 
« be the angle between the direction of Eq and the plane of incidence, and y,, 72 the 
^rresponding angles for the reflected and refracted waves. Using formulae (86.5) and 
(86.7), we easily obtain the relations 


tany, = — 


cos (60-62) 
cos {6 q -I- 62) 


tanyo. 


tan y2 = cos (60 — 62) tan yo- 


( 86 . 12 ) 


The angles yo, yi and y2 are equal for all angles of incidence only in the obvious cases 
’/o = 0 and yo = jK they are also equal for normal incidence {6q = 62 =G) and for grazing 
incidence {6 q = jn, in which case there is no refracted wave). In all other cases the formulae 
(86.12) give (by virtue of the inequalities 0 < 6 q, 62 < jtt and, as we shall assume, 0 < yo 
< ^, 0 < y,, y2 < Tt) the inequalities y, > yo, y2 < To- Thus the direction of E is turned 
away from the plane of incidence on reflection, but towards it on refraction. 

A comparison of the two formulae (86.10) shows that, at all angles of incidence except 
60 = 0 or ^n, R|| < R^- Hence, for example, when the incident light is natural the reflected 
light is partly polarized, and the predominant direction of the electric field is perpendicular 
to the plane of incidence. The refracted light is partly polarized, with the predominant 
direction of E lying in the plane of incidence. 

The quantities R^ and R^ depend quite differently on the angle of incidence. The 
coefficient R^ increases monotonically with the angle 6q from the value ( 86 . 8 ) for 6q = 0 . 
The coefficient R^ takes the same value ( 86 . 8 ) for 6q = 0 , but as 6^ increases R^ decreases to 
zero at 6 q = 6p before monotonically increasing. 

Here two distinct cases occur. If the reflection is from the “optically denser” medium, i.e. 

> El, then R|| and R^ increase to the common value of unity at 6(, = (grazing 
Incidence). If, on the other hand, the reflecting medium is “optically less dense” (E2 < fii), 
^th coefficients become equal to unity for 6 q = (f,, where 


sin6, = ^(£2/6)) = H2/«i; 


(86.13) 


r is called the angle of total reflection. When 6 q = 6^ the angle of refraction 62 = 2^- i-^- 
•^efracted wave is propagated along the surface separating the media. 

[ Reflection from an optically less dense medium at angles 6 q > 6^ requires special 
t>nsffieration. In this case is purely imaginary (see (86.2)), i.e. the field is damped in 
’^ndium 2. The damping of the wave without true absorption (i.e. dissipation of energy) 
^'gnifies that the average energy flux from rr^dium 1 into medium 2 is zero (by simple 
Calculation it can easily be seen that the vector S giving the average energy flux in medium 2 
Iin the x-direction). That is, all the energy incident on the boundary is reflected back into 
■’1‘edium 1, so that the reflection coefficients are = R„ = 1. This phenomenon is called 
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total reflection f The equality of and Rn to unity can, of course, be obtained directs 
from Fresnel’s formulae (86.4) and (86.6). ^ 

For Qq> 6^ the proportionality coefficients between E, and become complex 
quantities, of the form {a — it)/(a + ih). The quantities and R# are given by the squared 
moduli of these coefficients, which are equal to unity. The formulae give, besides the ratio 
of the magnitudes of the fields in the reflected and incident waves, the difference in their 
phases. For this purpose we write E^^ = e~'^>Eg^. ThenJ 


tan^^j^ = yj (e, sin^6Q — E2)/y/Ei cosOg. 
tanII = y/{Ei^ sin^^Q —e,e2)/e2 cosOq. 


(86.14) 


Thus total reflection involves a change in the wave phase which is in general different for 
the field components parallel and perpendicular to the plane of incidence. Hence, on 
reflection of a wave polarized in a plane inclined to the plane of incidence, the reflected 
wave will be elliptically polarized. The phase difference ^ ^ „ is easily found to be 

such that 


tan^^ = 


cos6o\/(ei sin ^'60 — 62) 
sin^ 60 


(86.15) 


The difference is zero only for = 6^ or Sq = ^ 7t. 


PROBLEMS 

Problem 1. Find the manner in which the reflection coefficient approaches unity near the angle of total 
reflection. 

Solution. We put 6(, = 6, — d, where ^ is a small quantity, and expand sin 60 and cos 6^, in formulae (86.10) 
in powers of S. The result is 

Ri = l-4V'(2^)(n^-l)-i, 

R, = l-4V'(2^)n^(n^-lH, 

where = Si/Ej > 1- The derivatives dR/d^ become infinite as ^“1 when ^ 0. 

Problem 2. Find the reflection coefficient for almost grazing incidence of light from a vacuum on the surface 
of a body for which e is almost unity. 

Solution. Formulae (86.10) give the same reflection coefficient: * 

S R, S [<^>0+E-1)r/(E- 


Problem 3. Determine the reflection coefficient for a wave incident from a vacuum on a medium for which 
both E and g are different from unity. 

Solution. Calculations entirely analogous to those given above furnish the result 
^ I g cos flo - y(Eg - sin^ flo) P 

^ | ECOseo-v/(E/i-sin^eo) p 

t The reflection coefficient is always equal to unity in reflection from a medium with e real and negative. In such 
a medium there is again no true absorption, but the wave cannot penetrate into it. 
t If (a- ib)/(a + ib) = then tan iS = bja. 
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Problem 4. A plane-parallel layer (region 2) lies between a vacuum (region 1) and an arbitrary medium 
H|on 3)- Light polarized parallel or perpendicular to the plane of incidence falls on the layer from the vacuum. 
®^ss the reflection coefficient R in terms of those for semi-infinite media of the substances in regions 2 and 3. 

^lution. We denote by and A, the amplitudes of the field (E or H, whichever is parallel to the layer) in 

incident and reflected waves. The field in the layer consists of the refracted wave (amplitude A 2} and the wave 
■ned from region 3 (amplitude A2'). The boundary condition between regions 1 and 2 gives 


^jljere a and r, 2 are constants. In reflection from a semi-infinite medium of the substance in region 2, is zero, 
* -> so from (1) we have r,2 = Ai/Ag, i.e. rjj is the amplitude of reflection in that case. Another equation is 

f lined from (1) by interchanging A, and and replacing A2' by A2, which corresponds simply to a reversal of 

z-co 


^z-component of the wave vector: 


In region 3 there is only the transmitted wave, whose amplitude A3 satisfies the conditions 

A2e‘'*'= oAj, Aj'e”'* = — ar32A3 (3) 

analogous to (1), (2) with A, = 0. The exponential factors take account of the change in the wave phase over the 
thickness h of the layer, with 

t/t = {wh/c)y /(£2 - sin^ Oo). (4) 

Eliminating Aj from equations (3), we obtain 

A2'e-‘* = r22A2e‘*, (5) 


where £23 = —£33. 

From equations (1), (2) and (5) we find the amplitude of reflection from the layer: 



and the reflection coefficient R = |£|^. The significance of £33 is found from the fact that, for k — 0, £ must be the 
amplitude of reflection £,3 from a semi-infinite medium of the substance in region 3. Hence 

Formulae (6) and (7) give the required solution. It should be emphasized that their derivation involves no 
assumptions concerning the properties of regions 2 and 3, which may be either transparent or not. 

If regions 2 and 3 are transparent, then ip, £33 and £33 are all real, and £33 is the amplitude of reflection at the 
^pndary between semi-infinite media of the substances in regions 2 and 3. From (6) we have 


As I/, varies, R 

■|i^e£32. 

tnay be z 
®nd (6) gives 


varies between the limits [(£32 + £33) (£i2''23 + •) b"*! [('"iz “'■23)/(£i2£23 “ 1)]^- F®'' normal 

(Hi-n2)/(n3 -Fn2), and £33 and £33 are given by similar relations. If £13^ = £13113, then £33 = £33, 
eroforsomevalueofthethicknessofthelayer. Ifregion3isavacuum, then£3 3 = 0,£23 = -£33, 


*falso region 2 is transparent. 



sinh iip 

sinh [ii(£-l-log(-£3 2)] 


(9) 


R=__ 

(l-R33)=-l-4R33Sin",/£' 

- The transmission coefficient D for (he layer (between vacua) is 1 - R only if region 2 is transparent Otherwise 
®ust be calculated from equations (1) - (3), putting £33 = r^2. The amplitude of transmission d is 


(10) 


'“"‘I the transmis 


coefficient D = |dp. 
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Problem 5. Determine the reflection and transmission coefficients for light incident normally on a slaK 
very large complex permittivity e. “’’vvitli 


Solution. In this case r,2 = (1 - ^£1/(1 + ^e) ^ 
'■= -[1 -(2/^£)cothnH“‘, Ip = {cL)h/c)^E. If the sla 
r = - [ 1 + 2ic/E(oh ] ', and distinguish two cases; 

forl/|£|<?wh/c«l/y|£|, 
for wh/c <? 1/|£|, 

The transmission coefficient is, by formula (10), 
for a)/i/c~ I/^IeI, 
for w/i/c« l/y|£|. 

Again two cases can be distinguished: 

for 1/|£|-? (u/i/c<^ l/y|£|, 
for w/i/c <? 1 /IeI, 


(1-2,\/£), and formula (9) of Problem 4 
I is so thin that w/i/c« I/^IeI, then we can 

P = l-4c£'7w/l|£|^ 

R = (oV\e\^IAc^. 

d = — Ijjr sinh iip, 
d = (1 -i£a)/i/2c)“*. 

D = 4cVw^/i2|g|2^ 

D = I — E"tL)h/c. 


§87. The surface impedance of metals 


The permittivity of metals is, in magnitude, large compared with unity at low frequencies 
(as to -> 0, it tends to infinity as 1/co). The wavelength S ~ c/to^|£ | in metalsf is then small 
compared with the wavelength A ~ c/co in vacuum. If S (but not necessarily X) is also small 
compared with the radii of curvature of the metal surface, the problem of the reflection of 
arbitrary electromagnetic waves from the metal can be considerably simplified. 

The smallness of S implies that the derivatives of the field components inside the metal 
along the normal to the surface are large compared with the tangential derivatives. The 
field inside the metal near the surface can therefore be regarded as the field of a plane wave, 
and hence the fields E, and H, are related by 

E, = v/(/c/e)H,xn. (87.1) 


where n is a unit vector along the inward normal to the surface. Since E, and H, are 
continuous, their values outside the metal near the surface must be related in the same way. 
As M. A. Leontovich (1948) has pointed out, the equation (87.1) may be used as a boundary 
condition in determining the field outside the metal. Thus the problem of determining the 
external electromagnetic field can be solved without considering the field inside the metal. 
The quantity (/c/e) is called the surface impedance% of the metal, and we denote it 

byc = r+ir: 

C = x/(/^/a). 

In the frequency range where e can be expressed in terms of the ordinary conductivity of the 
metal, we have 

C = (1 -1)\/ (m/r/87tc7); 


this formula, with /c = 1, has already been given in §59. 

The (time) averaged energy flux through the surface of the metal is 

S = (c/87t)re(E, xH,*) = cC'|H, |^n/87c. 


(87.4) 


t Large values of v/e(co) are almost always complex. The electromagnetic field is damped inside the ^ 

that the wavelength in the body is also the depth of penetration of the field. If e(co) is expressed in terms o 
conductivity by (77.9), the quantity S is the same as the penetration depth used in §59 

t This name is also given to the quantity cC/47i. A 
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is the energy which enters the metal and is dissipated therein. 

articular, that 


Hence we see. in 
(87.5) 


This inequality determines the sign of the root in (87.2). 

As the frequency increases, the depth of penetration b becomes of the same order as the 
mean free path / of the conduction electrons.+ In this case the spatial of the 

field renders impossible a macroscopic description of it in terms of*^^ permittivity e. 
However, a boundary condition of the form 

E, = CH,Xn (87-6) 


still holds at such frequencies. The field inside the metal near the again be 

regarded as a plane wave, although it is no longer described by the macroscopic 

Maxwell’s equations. In such a wave the fields E and H must be lineafly related, and the 
only possible linear relation between the axial vector H and the polar vedor E is (87.6). The 
coefficient C in this formula is the only quantity characterizing the nie^^( which must be 
known in order to find the external electromagnetic field. The however, 

requires the use of kinetic theory (see PK, §86). 

When the frequency increases further (usually into the infra-red region), the macro¬ 
scopic description of the field again becomes possible, and e is agaif meaningful. The 
reason is that, on absorbing a large quantum hw, a conduction electro^ acquires a large 
amount of energy, and its mean free path is therefore reduced, the ineQOulity / ^ being 

consequently again fulfilled. The impedance C is again inversely proporhonal to y/e.J In 
this frequency range the real part of e(a)) is large and negative, and its imaginary part is 
small. The inequality / ^ is the condition for both e' and e" to macroscopically 
significant. The macroscopic significance of the large quantity e' alon^’ however, can be 
ensured by the fulfilment of the less stringent condition vjca b, where ^ fhe velocity of 
the conduction electrons in the metal. If this condition holds, the spatial of 

the field may be neglected in considering the motion of the electrons-§ 

The inequality C' > 0 is always satisfied by the real part of the formula 

(87.2) holds, we can also draw certain conclusions concerning the sign of^ • example, if 
the dispersion of e is more important than that of p (i.e. if p may be taken as real), the 
condition e" > 0 gives C'C" < 0 and, since C' > 0, C" < 0. This is the moS* usual case. If the 
'spersion of ^ is determined by that of p, however, a similar argument shows that C > 0. 
The concept of impedance can also be applied to superconductor®- ^ characteristic 
property of superconductors is that the penetration depth b is small even in the static case 
= 0). At fairly low frequencies the magnetic field distribution can he taken to be the 
®ame as the static distribution. To determine the electric field we nse the equation 
curlE = icoH/c, taking the z-axis along the outward normal to the surface of the 


Co mean free path depends considerably on the temperature of the metal. In practice, the temperatures 
nsidered are usually very low, in the helium range, and the phenomena under consideration occur in the range 
.J'cry short radio waves. 

is r ®i>ould be borne in mind, however, that equation (87.6) can be used as a boundary condition only while |e| 
at-ge (i.e. f is small), and certainly does not hold at optical frequencies. We assume that p ~ 1, and so small C 
(.teespond to large |e|. Ifp ^ 1, the inequality ^ /. must be fulfilled if the boundary condition (87.6) is valid, and 

^refore we must have ^(ne) S> 1, so that f = Vfi'/c) may not be small. 

^ This situation is more fully discussed in PK, §87. 
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superconductor. Neglecting the tangential derivatives in comparison with the lar 
derivatives, we have dEJdz = icoHJc, and similarly for E Integrating with resnenf! 
through the body gives ^ z 

£x(0) = ^ j H,dz, 

£,(0) being the value of E^ for z = 0, i.e. at the surface of the body. We quantitatively defin 
the penetration depth by the relation ^ 

j^H,dz = H,(0)^. (87 

Then £^(0) = i(i)H^{G)d/c. Comparing this with the boundary condition (87.6), we find that 
the impedance of a superconductor (in the frequency range considered, which in practice 
extends to about the centimetre radio wave-band) is given by 

C=-icoS/c. (87.8) 

This expression is the first term in an expansion of ((m) in powers of the frequency, and the 
expansion for superconductors thus begins with a term in co. The next term, which is 
proportional to and is real, is the first term in the expansion of C'-t 
The impedance C(<y), regarded as a function of the complex variable co, has many 
properties analogous to those of the function e(m) (V. L. Ginzburg, 1954). The boundary 
condition, which for a monochromatic wave has the form (87.6), must in general be taken 
as the operator relation 

E, = CH,xn, (87.9) 

expressing the value of E, at any instant in terms of the values of H, at all previous instants 
(cf. §77). As in §82, it therefore follows that the function C(m) is regular in the upper half¬ 
plane of CO, including the real axis except for the point a» = 0. The condition that E, be real 
when H, is real gives C( — co*) = C*(a»). Finally, since the energy dissipation is determined 
by the real part of C(co) (and not by the imaginary part as for e(a))), it follows that is 
positive, and does not vanish for any real oo except co = 0. Arguments similar to those given 
in §82 then lead to the conclusion that re C(co) > 0 throughout the upper halt-plane. Hence, 
in particular, ^(co) has no zeros in the upper half-plane. 

The regularity of C(co) in the upper half-plane again leads to Kramers and Kronigs 
formulae. A particularly important formula is 

n J x-co 

Using the fact that C{x) is even, we can also write 


t The microscopic theory shows that the (o^ term in the impedance also contains a factor varying 
logarithmically with (o; see PK, §§96, 97. 
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n J 

(87.10) 


_ 1 in the numerator of the integrand may be omitted, since the principal value of 
I'Tintegral of l/(x"-co") is zero. 

' The above statements concerning the function CM are equally applicable to the 
^^ocal function 1/C(<y); the operator C“ ‘ converts E, into H, xn. In particular, (87.10) 

becomes a. 

>(CO)r = -^ P j dx. (87.11) 


For small C this formula may be more useful than (87.10). In the form (87.11), however, it is 
not applicable to superconductors, for which C' *, according to (87.8), has a pole at to = 0. 
A simple modification in the derivation, analogous to that which changes (82.7) into (82.9), 

[C“‘(m)l" =-—P [ ^^2-^dx + -^. (87.12) 

0 

To conclude this section we shall discuss, as an example of the use of the impedance, the 
reflection of a plane electromagnetic wave incident from a vacuum on the plane surface of a 
metal with surface impedance C- If the vector E is polarized perpendicular to the plane of 
incidence, the boundary condition (87.6) gives 

Eo + E^=C {Ho - Hi )cos Oq = C (Ho - Hi) cos 
the notation being the same as in §86. Hence, since C is small, we have EjEo 
= -(1 -2Ccos6oh and the reflection coefficient is 

= 1-4C'cosOo- (87.13) 

If, on the other hand, Eq lies in the plane of incidence, the boundary condition in the 
form CH, = nxE, gives 

C(Ho+Hi) = (£o-Ei)coseo = (Hq - HJ cos ^o, 


't'hence the reflection coefficient is 

^ ^ I cosep-C P 
" I cos Op + C| 

angles of incidence not close to \n 

Rii = 1 —4C' secflp. 

ff> on the other hand, (/)o = - dp « b then 



This expression takes a minimum value (ICI -C'VdCI +0 for .^p = Kl- 


(87.14) 

(87.15) 

(87.16) 
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Except for the special case (87.16), the reflection coefficient for a surface with small ( 
close to unity. A surface with C 0 is perfectly conducting and also perfectly reflecting. The 
boundary condition at such a surface is simply E, = 0, analogously to that for the 
electrostatic field at the surface of a conductor. In a variable field, however, the fulfilment of 
this condition necessarily implies the fulfilment of a certain condition on the magnetic field- 
the equations iwH/c = curl E and E, = 0 on the surface imply that H„ = 0 there. Thus the 
normal component of the magnetic field must be zero on a perfectly conducting surface in a 
variable electromagnetic field. In this respect such a surface resembles the surface of a 
superconductor in a static magnetic field. 


PROBLEM 

Determine the intensity of thermal radiation (of a given frequency) from a plane surface of small impedance. 

Solution. According to Kirchhors law, the intensity d/ of thermal radiation into an element of solid angle 
do from an arbitrary surface is related to the intensity d/,, of radiation from the surface of a black body by 
d/ = (1 -K)d/o, where R is the reflection coefficient for natural light incident on the surface concerned. 
Calculating R = i(R^ + R,) from formulae (87.13) and (87.14) apd using the isotropy of radiation from a black 
surface (d/^ = lodoHn), we have 




Effecting the integration and omitting terms of higher order in C we find 

In particular, for a metal whose impedance is given by formula (87.3) (/i — 1), v 


§88. The propagation of waves in an inhomogeneous medium 

Let us consider the propagation of electromagnetic waves in a medium which ^ 
electrically inhomogeneous but isotropic. In Maxwell’s equations curif — ' 

curl H = — iecoE/c (we put everywhere g = 1), e is a function of the coor 
Substituting for H from the first equation in the second, we obtain for E the equa 


AE + (emVc^)E-grad divE = 0. 


Elimination of E gives for H the equation 

A H + (emVc^)H + (l/£) grad e X curl H = 0. 

These equations are considerably simplified in the one-dimensional case, where e 
only in one direction in space. We take this direction as the z-axis, and 


e tms aireciiou me -- 

whose direction of propagation lies in the xz-plane. In such the 


independent of y. and the uniformity of the medium in the x-direction means ^ ^ 
dependence on x can be taken as being through a factor with k a constant, 
the field depends only on z, i.e. we have a wave which is said to pass normally througi 
layer of matter in which e = e(z). If k ^ 0, the wave is said to pass obliquely. 

For K fO two independent cases of polarization m 


t be distinguished. In one. 


the 
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tor E is perpendicular to the plane of propagation of the wave (i.e. it is in the >- 
ction), and the magnetic field H accordingly lies in that plane. Equation (88.1) becomes 


d^E 


(88.3) 


the other case, the field H is in the y-direction, and E lies in the plane of propagation. 
[Jere it's more convenient to start from equation (88.2), which gives 


d /I dH\ 
dzle dz ) 


]h = o. 


(88.4) 


We shall call these two types of wave E waves and H waves respectively. 

The equations can be solved in a general form in the important case where the con¬ 
ditions of propagation approximate to those of geometrical optics. In what follows 
we shall assume that the function e(z) is real.f In equation (88.3) the quantity Inljf where 
/(z) = vjbf plays the part of a wavelength in the z-direction. The approximation of 

geometrical optics corresponds to the inequality 

(88.5) 

dz V/ 

and the two independent solutions of equation (88.3) are of the form 


e±.lV/dz 


( 88 . 6 ) 


The condition (88.5) is certainly not fulfilled near any reflection point, where / = 0. Let 
z = 0 be such a point, with T’ > 0 for z < 0 and /< 0 for z > 0. At sufficiently great 
distances on either side of z = 0, the solution of equation (88.3) is of the form (88.6), but to 
establish the relation between the coefficients in the solutions for z > 0 and z < 0 it is 
necessary to examine the exact solution of equation (88.3) near z = 0. In the neighbour¬ 
hood of this point, /(z) can be expanded as a power series in z:f= — az. The solution of the 
Equation d^Ejdz^ - azE = 0 which is finite for all z is 

E = (A/ai'®)0(ai'^z). (88.7) 

"'here 

0(<^) = ^ I cos (^M^ + uf)du 

0 

'^Ihe Airy function; we everywhere omit the factor + in £.J The asymptotic form of 


p^|,.^Quation (88.31 bears a formal resemblance to SchrOdinger’s equation for one-dimensional motion of a 
i^niele in quantum mechanics, and the approximation of geometrical optics corresponds to the quasi-classical 
Here we shall give the final results; their derivation may be found in QM, Chapter VII. 

V Here we make use of the Airy function as defined in other volumes of the Course of rheoretical Physics. It is 
" tnore usually defined as 
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the solution of equation (88.3) for large |z| is 


E cos^ J y/fdz+^n'j for z < 0, 
0 


( 88 . 8,1 


with the same coefficient A as in (88.7). The first of these expressions represents the 
stationary wave obtained by superposing the wave incident in the positive z-direction and 
the wave reflected from the plane z = 0. The amplitudes of these waves are both equal to 
\Alf^, i.e. the reflection coefficient is unity. Only an exponentially damped field penetrates 
into the region z > 0. 

As the reflection point is approached, the wave amplitude increases, as is shown by the 
factor f* in the denominators in (88.8). To determine the field in the immediate vicinity of 
that point, the expression (88.7) must be used. This function decreases monotonically into 
the region z > 0 and oscillates in the region z < 0, the successive maxima of | £ | continually 
decreasing. The first and highest maximum is reached at a^z = —1-02, and its value is 
£ = 0.949 

So far we have spoken of solutions for E waves. It is easy to see that, in the 
approximation of geometrical optics, entirely similar formulae are valid for H waves. If we 
substitute in equation (88.4) H = Uyje, the derivatives of e appear as products with u, but 
not with neglecting therefore the terms containing these derivatives, which are small by 
(88.5), we obtain for the function u(z) the equation 



which is of the same form as (88.3). Hence the formulae for H differ from (88.6)-(88.8) only 
by a factor y/e. 

A curious difference in the behaviour of the two types of wave occurs when an obliqu^'l 
incident wave (k A 0) is reflected from a layer in which e(z) passes through zero.^ The 
reflection takes place from the plane on which/(z) = ea»^/c^ — = 0, i.e. the wave doe* 

not reach” the plane where e = 0. The E wave penetrates beyond the plane only as ac 
exponentially damped field. When an H wave is reflected, however, there is superposed on 
similar damped field a strong local field near the plane on which e = 0 (K. FOrster in 
1949).t This may be shown as follows. 

Let e = 0 at the point z = 0. Near this point, we write 


and equation (88.4) takes the form 


d^H 

dz" 


k^)H = 0 . 


(88.1®' 


t This is a singularity of (88.4), and geometrical optics is therefore invalid near it, although/(z) does not van' ^ 
and (88.5) may be valid I 
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ording to the general theory of linear differential equations, one of the solutions of this 
^ yation, which we call Hj (z), has no singularity at z = 0, and its expansion in powers of z 
^ ins with z^: 

H,(z) = z^+ ... . 


other independent solution has a logarithmic singularity, and its expansion is 
H2(z) = Hi(z)logKz + ^+ .... 


Xhe parameter a occurs only in the subsequent terms of the expansions. To find the field 
near z = 0, it is not necessary to discuss the choice of a linear combiiiation of Hj and 
that satisfies the conditions at infinity. We need only note that as z ^ 0 this combination 
tends to a constant (Hq, say) and has a logarithmic singularity 

HsHo(l+iKVlogKz); 

here, the leading singularity term has been written together with the constant. The electric 
field is determined from Hy = H by Maxwell’s equations —{icle(a)dHjdz, 

= {icjeoS)dHjdx. The dependence of H on x is given by the factor e'^’^, and the leading 
terms in and E^ are therefore 

E^ = Ho{iK^c/ao})logKZ, E^ = Ho {Kc/aa){l/z). (88.11) 

These become infinite as z -> 0. 

In reality, of course, the absorption which must be present in the medium, even though 
slight, means that the field attains large but not infinite values compared with the weak 
field in the adjoining region. It is noteworthy, however, that even an infinitesimal 
imaginary part of e causes a finite dissipation of energy. Let e = —az + iS with ^ + 0. 

Then the analytical continuation of the logarithm in (88.11) from the right-hand to the left- 
hand half of the z axis must be carried out from below in the complex z-plane, and for z < 0 
we have 

E^ = Ho(iK:^c/uco)(log|K:z| - in). 

The (time) averaged energy flux along the z-axis; 

\ = {c/Sn)ie{E^H*) 

(59.9a)) is zero for z > 0; for z < 0, the presence of a real part of E^ gives a non-zero 
’’^'■gy flux towards the plane z = 0, where this energy is dissipated:! 

S, = K^c^Ho^/Swa ( 88 . 12 ) 

■ B- Gil’denburg, 1963). 


This result can also be derived from the expression (80.4) for the energy dissipated in unit volume; 

co£"|E|^ 


STtro + S^ 

= K^c^Hg^S{z)/Scuo- 


ation with respect to z gives (88.12). 
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PROBLEM 


A surface H wave can be propagated along a plane boundary between two media whose permittivities e, and 
— IejI are of opposite signs. The wave is damped in both media. Determine the relation between the frequency and 
the wave number. 

Solution We take the boundary surface as the xy-plane, the wave being propagated in the x-direction and 
the field H being in the y-direction. Let the half-space z > 0 contain the medium with the positive permittivity e,, 
and the half-space z < 0 that with the negative permittivity Cj. We seek the field in the wave damped as z -► ± oo in 
the form 

H, = Hoc'** - fCj = - o,\/c^) for z > 0, 

Hj = fcj = + m^|Ej|/c^) for z < 0, 

where k, k, and kj are real. The boundary condition that = H be continuous is already satisfied, and the 
continuity condition on £, gives (1 /ei)5Hi/5z = for z = 0, or Kj /e, = k^/Ie^ |. This equation can 

be satisfied if e, < [e^I (and if EjEj < 0, as has been assumed). The relation between k and to is 

k^=m^E,|Ej/C^(|E,|-E.). 

It is easily seen that surface E waves cannot be propagated. 


§89. The reciprocity principle 

The emission of monochromatic electromagnetic waves from a source consisting of a 
thin wire in an arbitrary medium is described by the equations 

curl E = icoB/c, curl H = - icoDjc -f 47cjex/c, (89.1) 

where jex is the density of periodic currents flowing in the wire which are extraneous to the 
medium. 

Let two different sources (of the same frequency) be placed in the medium; we denote by 
the suffixes 1 and 2 the fields due to these sources separately. The medium may be 
inhomogeneous and anisotropic. The only assumption which we shall make concerning 
the properties of the medium is that the linear relations D,. = Bj = hold, the 
tensors and being symmetrical. Under these conditions it is possible to derive a 
relation between the fields of the two sources and the extraneous currents in them. 

We take the scalar products of the two equations curl Ej = ifcBj, curl Hj = — i7cDi 
+ 47cjex,i/c with H2 and E2 respectively, and of the corresponding equations for E2 and 
with —Hi and — Ej. Adding all four together, we obtain 

(H2 • curl Ej — Ej ■ curl H2) + (E2 • curl Hj - Hi • curl E2) 

= (icti/c)(Bi •H2 —Hj •B2) +(ico/c)(Ei -02 — 01 •E2) 

+ (47c/c)(iex,l-E2-jex.2-Ei). 

But B] -02 = Pitfliiflii = Hi •B2, and Ej • O2 = Oj •E2, so that the first two terms on 
the right-hand side are zero. The left-hand side can be transformed by a formula of vector 
analysis, and the result is 

div [E, X H2 - E2 X Hi ] = (47C/C)(j,x.i • E2 - jex,2 ’ EJ. 

We integrate this equation over all space; the integral on the left-hand side can be 
transformed into one over an infinitely remote surface, and is zero. Thus we have 

Jjex,I-E2dFi = Jjex,2-EidF2. (89.2) 

The integrals are taken only over the volumes of sources 1 and 2 respectively, since the 
currents jex,i and jex,2 are zero elsewhere. Since the wires are thin, the effect of each on the 
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field of the other may be neglected, and therefore Ej and E2 in formula (89.2) are the fields 
due to each of the two sources at the position of the other. Formula (89.2) is the required 
relation; it is called the reciprocity theorem. 

If the dimensions of the sources are small compared with the wavelength and with the 
distance between them, this formula can be simplified. The field of each source varies only 
slightly over the dimensions of the other, and in (89.2) we can take Ej and E2 outside the 
integrals and replace them by EJ2) and E2(l), 1 and 2 signifying the positions of the two 
sources: 

E 2 (l)-Jjex,ldF, =Ei(2)-|je,,2dF2. 

The integral jjex d F is just the time derivative of the total dipole moment ^ of the source. 
Since ^ = — ico 3^, we have finally 

E2(1)-^i = Ei(2)-i»»2- (89.3) 

This form of the reciprocity theorem applies, of course, only to dipole emission. If the 
dipole moment of the source is zero, or very small, the approximation made in going from 
the general formula (89.2) to (89.3) is inadequate; see Problem 1. 


PROBLEMS 


Problem 1 Derive the reciprocity theorem for quadrupole emitters and for magnetic dipole emitters. 
Solution. If d f' = 0, the next terms in the expansion must be taken in the integrals (89.2): 


we omit the suffix ex for brevity. The quadrupole moment tensor and the magnetic moment tensor are defined by 
D,, = - iwD,, = f \3(xJ, + xj,) - • j ] d K 




Using the equation curl E = icoB/c and assuming that e = constant near the sources (and so div E = 0), we obtain 


Hence we see that for quadrupole emitters the reciprocity theorem is 

p£,,(l) ^ g£,dl) \ _ /f£.,(2) ^ ^£,,(2)^1^ 

V 5xj dxi J \ (■'Xj ?x, } 

and for magnetic dipole emitters 

= B,(2)-.^j. 

Problem 2. Determine the intensity of emission from a dipole source immersed in a homogeneous isotropic 
medium as a function of the permittivity e and permeability \i of the medium. 

Solution. By substituting E = .yH = H'. a- = to’ 'y/(E/i), equations (89.1) are brought to the form 
curl E' = ifo'H'/f, curl H' = — ifu'E7c + 47tj^,/c, which do not involve e or/i. The solution of these equations for 
dipole emission gives a vector field potential in the wave region (see Fields, §67) A' = (1 /cPp) |lex L where Pp is 
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the distance from the source; the phase factors are omitted, since they do not affect the calculation of the intensi, ■ 
Hence we see that, for given j^, we can put A' = Ao, where the suffix 0 signifies the value for the source field 
vacuum. The values of H' and E' are 

H' = ik' X A' = x Ao = VMHo, E' = H'. 

Hence H = y(E/j)Ho, E = /jEo and J = This is the required solution. 

§90. Electromagnetic oscillations in hollow resonators 

Let us consider the electric field in a hollow evacuated resonator with perfectly ] 
conducting walls. The equations of the monochromatic field in the vacuum are 

curl E = (coH/c, curlH = — icoE/c. (90.1j 

The boundary conditions on the surface of a perfect conductor (i.e. one whose impedance j 
C = 0) are 

E, = 0, H„ = 0. (90.2) 

To solve the problem, it suffices to consider either E or H. For instance, eliminating H from 
equations (90.1), we find that E satisfies the wave equation 

AE + (coVc^)E = 0, (90.3) 

together with the equation 

divE = 0, (90.4) 

which does not follow from (90.1). Solving these equations with the boundary condition 
E, = 0, we find the field E, and then H can be derived from the first of (90.1). The boundary 
condition = 0 is automatically satisfied. 

When the shape and size of the cavity are given, equations (90.3) and (90.4) have 
solutions only for certain values of co, called the eigenfrequencies of the electromagnetic 
oscillations of the resonator concerned. For ^ = 0 the electromagnetic field does not 
penetrate into the metal, and no loss occurs there. All the characteristic oscillations are 
therefore undamped, and all the eigenfrequencies are real. The latter are infinite in number, 
and the order of magnitude of the lowest eigenfrequency coj is c//, where / is the lineal 
dimension of the cavity. This follows immediately from dimensional considerations, since j 
is the only dimensional parameter characterizing the problem if the shape ol»the resonator 
is given. The high eigenfrequencies (co > c//) lie very close together, and the number o 
them per unit range of co is Vo?l2n^c^, which depends on the volume V of the resona l 
but not on its shape (see Fields, §52). 

The (time) averaged values of the electric and magnetic field energies in the resonator 
respectively i|(|E|V8rt) dF and if(|Hp/87c) dF. We shall show that they are equal. Ls 
the first equation (90.1), we write jH-H* dF= (cVco^)fcurlE-curlE*dl'. The sec 
integral can be integrated by parts: 

I curl E • curl E* d F = I curl E* ■ df X E + J E • curl curl E* d F. 

Since E, = 0 on the boundary of the volume considered, the surface integral is zero, leavi^f 
J|H|^d F = -^ Je - curl curl E* d F 

= -(c^/co^) Je-AE* dF / 


590 

or. by (90-3)^ 
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||Hpdl''=||EpdF. (90.5) 

Xhis completes the proof.t 

Undamped oscillations in a resonator are obtained if the impedance of its walls is 
assumed to be zero. Let us now ascertain the effect on the eigenfrequencies if the 
jjjjpedance of the walls is small but not zero. 

Xhe (time) averaged energy dissipated in the walls of the resonator in unit time can be 
calculated as the flux of energy into the walls from the electromagnetic field in the cavity. 
Using the boundary condition (87.6) on the surface_of a body with impedance C, we write 
the normal component of the energy flux density as = (c/Stc) re (E, x H,*) = cC'|H,p/87c, 
where C' is the real part of C In this expression, which already contains the small factor C, 
we can as a first approximation take H to be the field obtained by solving the problem with 
I = 0. The total energy dissipated is given by the integral 

l^friHpd/, (90.6) 


taken over the internal surface of the resonator. The field amplitude is damped in time with 
a decrement obtained by dividing (90.6) by twice the total energy of the field, namely 
i|(|Ep + |H|")dF/87c = ||H|Mf/87c. 

The damping decrement is determined by the imaginary part |co"| of the complex 
frequency to = cd' + ico".} Writing the formula in the complex form 


CO — COq 


= -lie 


jlHpdU’ 


(90.7) 


M and COq being the frequencies with and without allowance for C, we can determine not 
only the damping decrement but also the change in the eigenfrequencies themselves. The 
latter is seen to be determined by the imaginary part of C We have mentioned in §87 that 
usually < 0, and the eigenfrequencies are then reduced. 

In actual calculations, it may be more convenient to transform the volume integral in the 
Nominator of (90.7) into one over the surface. Since the vector H is tangential to the 
surface, we have identically 


|(H • H*)(r • df) = |(H ■ H*)(r • df) - |(H • r)(H* • df) - ^(H* • r)(H • df). 

integrals on the right are transformed by putting df-> dF grad, and using (90.1) we 
Obtain 

|(H ■ H*)(r • df) = i7c|r • (H x E* - H* x E) dF+ |H • H*d F. 

Similarly, using the identity rx(Exdf) = E(r-df)-(r-E)df = 0 (by the boundary 


’bat always mean the fields corresponding to a particular eigenfrequency. It is not difficult to show 

s nelds corresponding to two different eigenfrequencies and co,, are orthogonal: 

JE„ ■ Ej»d E = JH„ - Hfc*d E = 0. 

'UsVa engineering the Q factor or quality of the resonator, defined as the ratio <o'jXfa’X is generally used 
of the damping decrement. 
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condition E, = 0), we have 

|(E - E*)(r ■ df) = - ^(E • E*)(r ■ df) + §(E ■ r)(E* • df) + |(E* • r)(E ■ df) 

= (7c|r-(HxE*-H*xE)dE-jE-E*dh. 

Subtracting and using (90.5). we obtain 

||HpdF = i^(|Hp-|E|>-df. (90.8) 

The formulae for a resonator filled with a non-absorbing dielectric for which e and n 
differ from unity are obtained from those for an evacuated resonator by replacing 

CO, E and H by to^ieg), ^eE and ^pH. (90.9) 

This is seen from the fact that the transformation just given converts equations (90.1) into 
the correct Maxwell’s equations for the medium: 

curl E = iojpH/c, curl H = — icoeE/c. 

In particular, the presence of the medium reduces each eigenfrequency by a factor ^[ep). 


PROBLEMS 

Problem 1. Determine the eigenfrequendes of a cuboidal resonator with perfectly conducting walls. 

Solution. We take the axes of x, y, z along three concurrent edges of the cuboid; let the lengths of these edges 
be a^, flj, Qj. The solutions of equations (90.3) and (90.4) which satisfy the boundary condition E, = 0 ate 

= /I, cos/c,x sin sin kjZ 

and similarly for E^, where 

k, = n,7i/a,, k^ = HaTr/oi, K = 

(n^, Oj, tij being positive integers). The constants A^, A2, A^ are related by 
A^k, + A2f + A3k, = 0, 


and the eigenfrequendes are 0/ = c^(kf +1 
The magnetic field is calculated from (1); 


H, = - {ic/oi)(A3k^ - A2k,) sii 


and similarly for El,. , nscillatio’' 

If two or all of the numbers tij, n, are zero. E = 0. Hence the lowest frequency corresponds to an os 

in which one of these numbers is 0 and the other two are I. iv two independcf* 

Since the relation (3) holds, the solution (I) (with given non-zero tii, Hj) involves only ^ 

arbitrary constants, i.e. each eigenfrequency is doubly degenerate. The frequencies for which one 01 .. 
zero are not degenerate. 

Problem 2. Determine the frequencies of electric dipole and magnetic dipole oscillations in a s 
resonator with radius a. 

Solution. In a stationary spherical electric dipole wave, the fields E and H are of the form 


E = c '^curlcurlj 


( sinkr \ 

—v- 


H = - ike ‘‘^‘curll - 


where b is a constant vector and k = cu/c (see Fields. §72). The boundary condition nx E = 0 at . 
cot ka = (kfl)- ‘ - ka. The smallest root of this equation is ka = 2.74. The frequency to = 2.74 1 /a is 
eigenfrequency of a spherical resonator. 
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In a stationary spherical magnetic dipole wave, we have 


The boundary condition on E gives the equation tan ka = ka, whose smallest root is ka = 4.49. 

Problem 3. A small sphere with electric and magnetic polarizabilities a„ and a„ is placed in a resonator. 
Determine the resulting shift in the resonator eigenfrequency. 

■ Solution. Let E and H be the fields in the resonator without the sphere. E, and H, those when the sphere is 
present. The fields E and H satisfy the equations (90.1); E, and H, satisfy 

curlE, = i&i,H,/c, curlH, = 47tjj/c-im,E,/c, (1) 

where ji is the current density in the sphere. We multiply the first of these equations by H*, the second by - E*, 
take the complex conjugates of equations (90.1), and multiply them by H, and - E, respectively. Addition of all 
four equations then gives 


div[Ei ^ + E* xHj ] = i^ro(H| ■ H* + E| • E*)/c — 47iji E*/c, 

where = ro, — ro is the required frequency shift. This equation is next integrated over the volume of the 
resonator. The left-hand side is zero, by Gauss’s theorem, since at the wall E, = E,, = 0. Since the sphere is small, 
the main contribution to the integral of the first term on the right arises at large distances from the sphere; at such 
distances, however, the field perturbation due to the sphere is small, and so we can put E, S E, H, = H. The 
integral of the second term is transformed as in §89 (and Problem 1 there), giving 

Jj,-E*dT'= -M^-Eo»-E-^-Ho») 

= -iml/o(aJEol" + aJHo|^), 

where Eq = E(ro), Hp = H(ro), rp is the position of the sphere, and Vg is its volume; the sphere is supposed to be 
so small that the fields E and H do not vary appreciably across it. 

Thus, with (90.5), we find the frequency shift 


c(,|Eol^ + c(JHo|^ ^^ 
w J|Hpdl//27r ‘” 

If the polarizabilities are complex, this formula gives both the frequency shift and the damping of the 
characteristic oscillations. 

Problem 4. A resonator is filled with a dispersionless transparent dielectric having permittivity 
etermine the change in the eigenfrequency due to a small change ^E(r) in the permittivity. 

_ ^lotion. The unperturbed field E^, Hp in the resonator satisfies the equations curl Eq = icogtlg/c, curl Hq 
, ~‘“o‘’oEo/c, and the perturbed field E, H satisfies curl E = i(tO(|-f ^ai)H/c, curl H = — i(ct>gEg + lOgSe 
Cq m)E/c, the term in ScoSe being neglected. Using these four equations as in Problem 3, we find 

div[(ExHo»)-E(Eo»xH)] = i(tOo&-E Eo^co) E • Eo*/c-E i^coH • Ho*/c 

= i(cOgSE + Egdco)Eg ' Eg*/C + iSwHg ' Hg*/C 


"'hence 


Sw/wg= -J|EoP&dL/2EoJ|EoPdK 

have used the fact that for a dielectric-filled resonator the relation (90.5) 


‘s clear from (90.9). 


J|HoPdL = EoJlEoPdE, 


The propagation of electromagnetic waves in waveguides 

waveguide is a hollow pipe t of infinite length, i.e. a cavity infinite in one direction. 


formulae below hold for an evacuated waveguide. Those for a waveguide filled with a 
ctric are obtained by means of the transformation (90.9). 


i-absorbing 
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whereas the resonators discussed in §90 are of finite volume. The characteristic oscillat; 
in a resonator are stationary waves, but those in a waveguide are “stationary” only ^ 
transverse directions; waves travelling in the direction along the pipe can be propagat j 
Let us consider a straight waveguide with any (simply-connected) cross-section unifom, 
along its length. We shall first suppose that the walls of the waveguide are perfect] 
conducting, and take the z-axis along the waveguide. In a travelling wave propagated in the 
z-direction, all quantities depend on z through a factor exp (ik^), with a constant 
The electromagnetic waves possible in such a waveguide can be divided into two types- in 
one, = 0, and in the other E^ = 0 (Rayleigh, 1897).The former type, in which the 
magnetic field is purely transverse, are called electric-type waves or E waves. The latter, in 
which the electric field is purely transverse, are called magnetic-type waves or H wares.-t 
Let us first consider E waves. The x and y components of equations (90.1) give 

^-ik,E^ = i-H^, -^-yik^E^ = i-H 

dy ^ c ox c ^ 

ik,H, = i-E^, ik,H^ = -i-E^. 

z y ^ X z X ^ y 


where = (ai^jc^) — k^. Thus, in an E wave, all the transverse components of E and H 
can be expressed in terms of the longitudinal component of the electric field. This 
component must be determined by solving the wave equation, which takes the two- 
dimensional form 

= 0 ( 91 - 2 ) 


(A 2 being the two-dimensional Laplacian). The boundary conditions for this equation are 
that the tangential components of E should vanish on the walls of the waveguide, and can 
be satisfied by putting , 

£3 = 0 on the circumference of the cross-section. 

According to formulae (91.1), the two-dimensional vector whose components 
proportional to the two-dimensional gradient of E^. When the condition (91. ) 
therefore, the tangential component of E in the xy-plane is also zero. 

Similarly, in an H wave the transverse components of E and H can be expressed m 
of the longitudinal component of the magnetic field: 


( 91 .^ 
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^ longitudinal field H, is given by the solution of the equation 

+ = 0 (91-5) 

^ath the boundary condition 

dHjdn = 0 on the circumference of the cross-section. (91.6) 

Recording to formulae (91.4), this condition ensures that the normal component of H is 

^^Thus the problem of determining the electromagnetic field in a waveguide reduces to 
u t of finding solutions of the two-dimensional wave equation A 2 /+ k / = 0 , with the 
Klundary condition / = 0 or dfjdn = 0 on the circumference of the cross-section. For a 
^ven cross-section, such solutions exist only for certain definite eigenvalues of the 

parameter . 

For each eigenvalue we have the relation 

of = c^{k^^+ K^) (91-7) 

between the frequency co and the wave number K of the wave. The velocity of propagation 
of the wave along the waveguide is given by the derivative 

^ ^ ^ cK ^ (91.8) 

dk^ uj 

For given k , this varies from 0 to c when varies from 0 to co. 

The (time) averaged energy flux density along the waveguide is given by the z-component 
of the Poynting vector. A simple calculation, using formulae (91.1), gives for an E wave 

S, = ^re(ExH*), = |%|grad2£,p. 

87 c ottK 

The total energy flux q is obtained by integrating over the cross-section of the 
waveguide. We have 

j'|grad2£,,pd/ = ^£,,*-^dl — jf,.* A 2£zd/. 

The first integral is taken along the circumference of the cross-section, and is zero on 
Account of the boundary condition = 0. In the second integral we replace A 2^2 by 
~ x^E^, and the result is 




JM/- 


he expression obtained for an H wave is the same with instead of E^. 

The electromagnetic energy density W (per unit length of the waveguide) may be 
'^^kulated similarly. It is simpler, however, to derive W directly from q, since we must have 
9 = IVu^. From (91.8) and (91.9), therefore. 


|£.Pd/. 


(91.10) 


h follows from (91.7) that, for each type of wave (for a given value of k^) there is a 
'^’'nimum possible frequency, namely ck. At lower frequencies the propagation of waves of 
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the type concerned is not possible. There is a smallest eigenvalue which is not zero r 

below). We therefore conclude that there is a frequency below which no wa 

can be propagated along the waveguide. The order of magnitude of C0n,in is c/a, where 
the transverse dimension of the pipe. 

This statement is valid, however, only for waveguides in which the cross-section is simni 
connected (as we have hitherto assumed). When the cross-section is multiply connected f 
the situation is quite different. In such waveguides not only the E and H waves described 
above but also another type of wave, whose frequency is subject to no restriction, can be 
propagated. Such waves (called principal waves) are characterized by the fact that 
(i.e. K = 0); the velocity of propagation is equal to the velocity of light c. We shall derive the 
chief properties of such waves, and shall see why such waves cannot occur when the cross- 
section of the waveguide is simply connected. 

All the field components in a principal wave satisfy the two-dimensional Laplace’s 
equation, Aj/ = 0. With the boundary condition/ = 0, the only solution of this equation 
regular throughout the cross-section (whether or not multiply connected) is/= 0. Hence 
we have £,, = 0 in a principal wave. 

With the boundary condition df/dn = 0, a regular solution is/= constant. It is easy to 
see, however, that when/is the constant must be zero (by a “constant”, of course, we 
mean a quantity independent of x and y). For, integrating the equation 


dH^ dH^ ico„ „ 

div H = —-1—^ H- H = 0 

ox cy c 

over the cross-section, we obtain ^ H„d/ + (ico/c) jH^df = 0; since = 0 on the circum¬ 
ference of the cross-section and is constant over its area, it follows that = 0. 

Thus a principal wave is purely transverse. For E^ = H^ = 0, the x and y components of 
equations (90.1) give ' 

H, = -E^ H^ = E„ (9111) 

i.e. the fields E and H are perpendicular and equal in magnitude. They are determined by 
the equations 

div E = ^-b-^ = 0, (curl E),, =^ 

8x8y ^ 8x 8y ^ 


with the boundary condition E, = 0. . 

We see that the dependence of E, and therefore of FI, on x and y is given by the so u 
of a two-dimensional electrostatic problem; E = - grad2 4>, where the potential (/> 
the equation A 2</> = 0 with the boundary condition (/> = constant. In a 
region, this boundary condition means that (/> = constant (and so E = 0 ) is t e 
solution regular throughout the region. This shows that waves of this type jy. 

propagated along a waveguide whose cross-section is simply connected. In ^ ^ 

connected region, on the other hand, the constant in the boundary condition nee n 
the same on the various separate parts of the boundary, and so Laplace s equation^ 
solutions which are not trivial. The electric-field distribution over the cross-section ° 
waveguide is the same as the two-dimensional electrostatic field between the p ate 
capacitor at a given potential difference. 
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' So far we have assumed the walls of the waveguide to be perfectly conducting f If the 
1] have a small but finite impedance, losses occur and the wave is therefore damped as it 
opagated along the waveguide. The damping coefficient can be calculated in the same 
as for the damping in time of electromagnetic oscillations in a resonator (§90). 

* The amount of energy dissipated in unit time per unit length of the walls of the 
veguide is given by the integral (c/87c)C'f |Hp dl, taken along the circumference of the 
" ss-section; H is the magnetic field calculated on the assumption that C = 0. Dividing this 
ession by twice the energy flux q along the waveguide, we obtain the required damping 
coefficient a. With this definition, a gives the rate of damping of the wave amplitude, which 
decreases along the waveguide as 

Expressing all quantities in terms of or by means of formulae (91.1) or (91.4), we 
obtain the following formulae for the absorption coefficients: for an E wave 

mr f|grad,£,pdf ^2) 

iK^k^c ||£^pd/ 


and for an H wave 

_ + (/c,VK^)|grad,H,P}df 

“ IKco \\Hfdf 

In an actual calculation it may be convenient to transform the surface integrals in the 
denominators into integrals along the circumference. The necessary formulae, whose 
derivation is similar to that of (90.8), are 


jlE^pd/ = ^ ^ (n •r)|grad2f;jM/, 


(91.14) 


When i.e. the frequency co^ck, the expressions (91.12) and (91.13) become 

infinite, but they are then no longer applicable, because their derivation presupposes that k 
is small compared with k^. 

Formulae (91.12) and (91.13) are not valid for a principal wave (in a waveguide with a 
®ultiply-connected cross-section), in which E^, and k are all zero. In this case all the field 

^cmponents can be expressed in terms of the scalar potential </>. Using the fact that the 
‘C ds H and E = — grad2</> in a principal wave are perpendicular and equal in magnitude, 
obtain the absorption coefficient 


rflgrad2</>|M/ 

2j|grad2</>pd/‘ 


(91.15) 


he propagation of a principal wave along a waveguide can be relatively simply 
scussed when its absorption coefficient is not small (so that formula (91.15) is 
Pplicable) but the wavelength c/co is large compared with the transverse dimension of 


waveguide. 
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As has been mentioned, the transverse electric field in a principal wave at any inst 
corresponds to the electrostatic field in a capacitor formed by the walls of the waveguij 
carrying equal and opposite charges. Let these charges be + e(z) per unit length. They aj-g 
related to the currents ±J(z) flowing on the walls by the equation of continuit 
delBt= —dJidz, or, for a monochromatic field, icoe^ dJjdz. Next, let C be the 
capacitance per unit length of the waveguide. The “potential difference” </>2 — </>, between 
its walls is e/C; differentiating this with respect to z, we obtain the e.m.f which maintains 
the current on the walls. (When absorption is present, the field is not purely tran.sverse) 
Equating the e.m.f to ZJ, where Z is the impedance per unit length, we have 



or 

d (\ dJ\ 

Substituting Z = R — icoLIc^, where R and L are the resistance and self-inductance per 
unit length of the waveguide, we can return from the monochromatic current components 
to currents which are arbitrary functions of time. Assuming the capacitance C to be 
constant along the waveguide, we arrive at the telegraph equation: 


1 d^J BJ LB^J 
C Bz^ Bt Bt^ 


(91.17) 


If there is no resistance (K = 0), this equation reduces, as it should, to the wave equation 
with a velocity of wave propagation ^(o^/LC) = c. The equation LC = 1 follows from the 
mathematical equivalence of the problems of determining 1/C and L for a giVen cross- 
section. The electric and magnetic fields between perfectly conducting surfaces are 
perpendicular and equal in magnitude (see (91.11)), and if this magnitude is given on the 
surfaces the charge density and the current density are respectively determined. Hence the 
coefficients of proportionality (1 /C and L) between the field energy and the squared charge 
and current respectively are the same. 


PROBLEMS 

Problem 1 . Find the values of k for waves propagated in a waveguide whose cross-section is a rectangle with 
sides a and b. Find the damping coefficients. 

Solution. In E wavest = constant x sin sin where k^ = nin/a, I:,, = n^rc/fc, with n, 5 

positive integers ^ 1. In H waves = constant x cos k^x cos k^y, and one of nj and r.^ may be zero. In bot 
of wave = kf + k^^ = nHnffa^+nf/b^). The smallest value of k corresponds to an Hio wave (the su 
show the values of n, and Wj) and is Kmin = (we assume that a > b). 

The damping coefficients are calculated from formulae (91.12) and (91.13) and are: for E waves 

a = 2Cc!}(kfb + kfa)/cK^k^ab, 


for waves 



t We everywhere omit the factor exp(iL-,z — icot). 
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2ck:^CT k/ 1 

a =- a + b + ~^{kfa + k^^b) . 

cok^ab L k: J 
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Problem 2. The same as Problem 1, but for a waveguide whose cross-section is a circle with radius a. 
Solution. Solving the wave equation in polar coordinates r, (p, we have for E waves 

= constant x J„{Kr) tup 


ith the condition JJko) = 0, which gives the values of k. In H waves the value of is given by the same 
formula, but k is determined by the condition J„'{Ka) = 0. The smallest value of k occurs for the H, wave, and is 
=1.84/0. 

"fhe damping coefficient is calculated from formulae (91.12)-(91.14). For E waves it is a = (oC'/cak^, and for H 






§92. The scattering of electromagnetic waves by small particles 

Let us consider the scattering of electromagnetic waves by macroscopic particles whose 
dimensions are small compared with the wavelength X ~ c/co of the wave undergoing 
scattering (Rayleigh, 1871). When this condition holds, the electromagnetic field near the 
particle may be supposed uniform. Being in a uniform field periodic in time, each particle 
acquires definite electric and magnetic moments ^ and jff, whose dependence on time is 
given by factors of the form e ”The scattered wave can be described as being the result of 
radiation by these variable moments. At distances R from the particle which are large 
compared with X (the wave region), the fields in the scattered wave are given by 


H' = -^{nx^-l-nx(,/^xn)), 

E' = H xn 


(92.1) 


(see Fields, §71), where the unit vector n gives the direction of scattering, and the values of 
^ and must be taken at the time t — Rjc. We denote the fields in the scattered wave by 
primed letters and those in the incident wave by unprimed letters. The (time) averaged 
intensity of radiation scattered into a solid angle do is d/ = |c|H'pR^ do/47c; dividing by 
fneenergy flux density in the incident wave c|H|V87c = c|Ep/87c, we obtain the scattering 
^''oss-section. 

The calculation of ^ and is particularly simple if the dimensions of the particle are 
^niall in comparison not only with X but also with the “wavelength” 6 corresponding to the 
I'squency co in the material of the particle. In this case we can calculate the polarizability of 
nn particle from the formulae for an external uniform static field, the only difference being, 
° course, that the values taken for e and p are those corresponding to the given frequency 
and not the static values. If, as usually happens, p is close to unity, the magnetic dipole 
in formula (92.1) may be omitted. 

Tor a spherical particle with radius a we have (see (8.10)) 


^ = FaE, a = 3(e - 1), 4n(e -t- 2), 


(92.2) 
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and the scattering cross-section is 

dtr = (a)/c)'*|ap sin^6do, (92.3) 

where 6 is the angle between the scattering direction n and the direction of the electric field 
E in the linearly polarized incident wave. The total cross-section is 

<T = 87rla|WEV3c''. (92.4) 

The frequency dependence of the cross-section is determined by the factor ojf and by the 
polarizability. At frequencies so low that a shows no dispersion, the scattering is 
proportional to It may be noted also that the cross-section is proportional to the square 

of the volume of the particle. 

If the incident wave is unpolarized (natural light) then the differential cross-section must 
be obtained by averaging (92.3) over all directions of the vector E in a plane perpendicular 
to the direction of propagation of the incident wave (i.e. perpendicular to its wave vector k). 
Denoting by S and <}> the polar angle and the azimuth of the direction of n relative to k ((f) 
being measured from the plane of k and E), we have cos 6 = sin 9 cos <j) (Fig. 47), so that 

dcr = (co /cf I a 1 ^ F ^ (1 - sin^ 9 cos^ (j))do. (92.5) 



On averaging over </>, we obtain the following formula for the cross-section for scattering 
of an unpolarized wave; 


dfT = \{(olcf\a.\^V^(\ -hcos^ 9)do, 


(92.6) 


where 9 is the angle between the directions of incidence and scattering. The , 

distribution (92.6) is symmetrical about the plane 9 = ^n, i.e. the forward scattering 

To do so, we notice that, for^a given direcuon o^^^, ^ „ 

direction of the e ectnc e .. j ^ plane, according as the azimuth 4) of th^ 

(the plane of scattering) is o or Let /„ and be the intensities of 

"on havlllg these two polarizations. The degree of depolarization is defined 
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as the ratio of the smaller to the larger of these quantities. By (92.5) we have 

/ii//x = cos^S. (92 7) 

If the scattering particle has a large permittivity, 

6 ~ cIwJ\e\ /. 

The dimensions of the particle may then be small compared with 2 but not small compared 
with 6. In the first approximation with respect to l/e, the electric moment of the particle 
may be calculated as simply the moment of a conductor (e ^ oo) in a uniform static 
external field. In calculating the magnetic moment, however, the induced currents in the 
particle are of importance, and the problem cannot be taken as static; instead, we must seek 
a solution of equation (83.2) (with p = 1): 

AH + £co^H/c^ = 0 (92.8) 

which becomes the field of the incident wave far from the particle. The magnetic and 
electric moments are of the same order of magnitude, and both terms in formula (92.1) 
must be retained. The angular distribution and amount of scattering are quite different 
from those discussed above (see Problem 2). 


PROBLEMS 

Problem 1. Linearly polarized light is scattered by randomly oriented small particles whose electric 
polarizability tensor has three different principal values. Determine the depolarizing factor for the scattered light. 

Solution. Neglecting, as above, the magnetic moment, we have from (92.1) 

E'= (coVc^R)(nxt3»)xii. 

The required depolarizing factor is given by the ratio of the principal values of the two-dimensional tensor 
= where the brackets denote an averaging over orientations of the scattering particle for a given 

direction of scattering n, and the suffixes a and fi take two values in the plane perpendicular to n (see Fields, §50). It 
>s more convenient, however, to average the three-dimensional tensor and then project it on the plane 

perpendicular to n; these components of the tensor < ) are proportional to the corresponding 

components Substituting = otjjEj, we have 

•n effecting the averaging we use the formula 

< 

jhis is the most general tensor of rank four which is symmetrical in i, I and k, m and contains only scalar constants. 
‘-°*'st3nts are determined from two equations obtained by contracting the tensor, firstly with respect to i, I 
k, m, secondly with respect to i, k and 1, m. They are 

1 


® = -ai.au*). 

polarized wave, the field amplitude E (we omit the time factor e’”') can always be defined so as to 
real. Then we have 


the z-axis be in the directic 
principal axes of the tensor 
'actor /,//, = BI[(A + B) sin^t 


< > = + B)E,E^ -I- BSa,E^. 

3n of n, and the xz-plane contain the directions of n and 
Taking the appropriate components of the tensor (1), we 
9 -I- B], where 6 is the angle between E and n. 


E; these axes are 
find the depolarizi 


(1) 
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Problem 2. Determine the cross-section for scattering by a sphere with radius a, for which c is large; it is 
assumed that a ~ 6. 

Solution. The problem ofcalculating the magnetic moment acquired by a sphere with given c (and /i = l)ii, 

a variable magnetic field H is the same as that solved in §59, Problem 1, except that k in the formulae derived there 
must be replaced by (ojejc. Thus M where 



When lltal 1, y = - (kafj'iQr, when [ka\ > 1, cot ka-* -i and y = ^(1 - 3i/fca). 

The electric moment can be calculated, in the first approximation with respect to 1/c, as simply the moment of a 
conducting (c -» oo) sphere in a uniform static electric field; ^ = a^E. 

Taking into account the fact that E and H are perpendicular, we have after a simple calculation, using (92.1), the 
following formula for the scattering cross-section; 

do- = (fl^m^/c'*) {lyp cos^ <f> + sin^ </> - (y -I- y*) cos S -I- cos^ S (cos^ </> -I- lyP sin^ </>)} do, 
where </> and S are the angles shown in Fig. 47. In scattering of unpolarized light we have 
do- = (a^o)'‘/c*){Hl -l-lyp] [1 -l-cos^ S]-(y-l-y*)cos S} do, 
and the degree of depolarization of the scattered light is /, ilj_ = I (y — cos S)/(l — y cos S)p. The total scattering 

cross-section is 0 -= 8710^(O'*!!-I-|yp)/3c^ i a - 

In the limit ko -* oo (i.e. when A > a > 5) we have y = i corresponding to scattering by a perfectly reflecting 
sphere into which neither the electric nor the magnetic field penetrates. The differential scattering cross-section is 

do- = (1 -I- cos^ S -1cosS)do. 

Note that the angular distribution is highly asymmetrical about the plane S = ^tc; the scattering is mamly 
backward, the ratio of forward and backward scattered intensities being 1; 9. 


§93. The absorption of electromagnetic waves by small particles 

The scattering of electromagnetic waves by particles is accompanied by absorption. The 
absorption cross-section is given by the ratio of the mean energy Q dissipated in a partic e 
per unit time to the incident energy flux density. To calculate Q we can use the formu a 

where ^ and are the total electric and magnetic moments of the particle, a^ the 
external fields G and § are replaced by the fields E and H in the scattered wave, 

Using the complex representation of quantities, we can write (see the last lootn 
to §59) 

Q = -ire(^-E*-l-./^-H*) = i£oT’(a/-ba„")|E| , 
where and are the electric and magnetic polarizabilities of the particle. Dividing by th 
incident energy flux, we obtain 103.21 

a = 4nCL> T{aJ' -b am”)/c- 

Let us apply this formula to absorption by a sphere with radius a < 2, ^^stiming it non^ 
magnetic {p = 1). The nature of the absorption depends considerably on the magnitu 

If^£ is small, then we have both a A and a « In this case the magnetic 
may be neglected in comparison with the electric polarizability. With the latter give 

,92.2), we have _ i2n.u3,, . t(a'+ 2)^ ^ (.^ ]. (93.3) 
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If, on the other hand, |£| ^ 1, the electric part of the absorption becomes small, and the 
Ijnagnetic absorption may be important even if we still have a. When this last condition 
holds (i.e. \ka\ 1), the magnetic polarizability is = {kaf/40n = a^co^elAOnc^ (see §92, 
problem 2) and the absorption cross-section is 


llncoa^E" f 1 ^ 

1 , 1 ^^ 90 ^ ;■ 


(93.4) 


When £ increases further, the electric part of the absorption becomes small compared 
with the magnetic part. In the limit 3 a (i.e. | /ca| ^ 1) we have = — 3/87c -|- 9ilSnka = 
~3ISn + 9icCl^ncoa, where C = l/V^ is the surface impedance of the sphere. Hence 

u = (93.5) 


It may be noticed that this formula could have been obtained more directly, without using 
the general expression for the magnetic polarizability a„(co) of the sphere. When C is small, 
the energy dissipation Q can be calculated by integrating the mean Poynting vector (87.4) 
over the surface of the sphere, the distribution of the magnetic field over the surface being 
given by the solution (54.3) of the problem of a superconducting (C = 0) sphere in a 
uniform magnetic field. 

Knowing the absorption cross-section of the sphere, we can immediately determine the 
intensity of the thermal radiation emitted from the sphere. According to Kirchhofif’s law 
(see SP 1, §63), the intensity dl in a frequency range dco is given in terms of <t(£o) by 

dl = 47cccr(£o)eo(co) dm, (93.6) 

where eo(m) = hco^lAn^c^ [exp (hco/T) - 1 ] is the spectral density of black-body radiation 
per unit volume and unit solid angle. 


§94. Diffraction by a wedge 

The ordinary approximate theory of diffraction (see Fields, §§59-61) is based on the 
assumption that the deviations from geometrical optics are small. It is thereby assumed, 
firstly, that all dimensions are large compared with the wavelength; this applies both to 
dimensions of bodies (screens) or apertures and to distances of bodies from the points of 
emission and observation of the light. Secondly, only small angles of diffraction are 
considered, i.e. the distribution of light is examined only in directions close to the edge of 
*fie geometrical shadow. Under these conditions, the actual optical properties of the 
substances involved are of no importance; all that matters is that they are opaque. 

If these conditions are not fulfilled, the solution of the diffraction problem requires an 
exact solution of the wave equation, taking into account the appropriate boundary 
'Conditions on the surfaces of the bodies, which depend on their properties. The finding of 
^uch solutions offers exceptional mathematical difficulties, and has been effected for only a 
Entail number of problems. A simplifying assumption is usually made concerning the 
Properties of the body at which the diffraction occurs, namely that it is perfectly 
'Conducting, and therefore perfectly reflecting. 

The following remark may be made here. It might seem reasonable to solve the 
'diffraction problem on the assumption that the surface of the body is “black”, i.e. 
'^mpletely absorbs light incident on it. In reality, however, such an assumption concerning 
dfie body in stating the exact problem of diffraction would involve a contradiction. The 
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reason is that, if the substance of the body is strongly absorbing, the coefficient of reflection 
is not small but, on the contrary, almost unity (see § 87)). Hence a reflection coefficient close 
to zero implies a weakly absorbing substance and a thickness of the body which is large 
compared with the wavelength. In the exact theory of diffraction, parts of the surface of the 
body at a distance from its edge of the order of the wavelength are necessarily of 
importance; but the thickness of the body near its edge is always small, so that the 
assumption that it is “black” is certainly untenable. 

Considerable theoretical interest attaches to the exact solution (A. Sommerfeld, 1894) of 
the problem of diffraction at the edge of a perfectly conducting wedge bounded by two 
intersecting semi-infinite planes. A complete exposition of the very complex mathematical 
theory, which involves the use of special devices, is beyond the scope of this book. Here we 
shall give, for reference, the final results.! 

We take the edge of the wedge as the z-axis in a system of cylindrical polar coordinates 
r, (p, z. The front surface (OA in Fig. 48) corresponds to </> = 0, and the rear surface (OB) 
to </) = y, where 27c — y is the angle of the wedge, the region outside it being 0 ^ ^ y. Leta 
plane monochromatic wave with unit amplitude be incident in the rvp-plane on the front 
surface at an angle (po to the surface; by symmetry, it is sufficient to consider angles 
(po < iy. We shall distinguish two independent modes of polarization of the incident wave, 
and therefore of the diffracted wave also: the edge of the wedge (the z-axis) may be parallel 
to either E or H. The letter u will denote and respectively. 

The electromagnetic field is then given in all space by the formula (the time factor e““"' 
being everywhere omitted) 

u(r, (p) = v(r, <p-<p(^) + v{r, <p 4- <pf\, (94.1) 

where the upper and lower signs correspond to the polarizations with E and H respectively 
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,n the z-direction, and the function v{r,ilj) is given by the complex integral 
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(94.2) 


= co/c. The path ofintegrationCintheC-plane consists ofthe two loopsC, andC2 
shown in Fig. 49. The ends of these loops are at infinity in parts of the C-plane (shaded in 
Fig. 49) where im cosC < 0, and so the factor e-‘*;rcosC tends to zero at infinity. The 
integrand in (94.2) has poles on the real C-axis at the points C = — + 2ny, where n is any 

integer. The integration may be taken along a path D = D,+D2 (Fig. 49) instead 
of C, adding to the integral the residues of the integrand at the poles, if any, in the range 
We write v as 

vir,ilt)=Vo{r,ilt) + v,{r,il/), (94,3) 


where is the integral (94.2) taken along the path D, and Vq is the contribution from the 
residues at these poles. Each pole gives rise to a term exp [ - ikr cos (ij/ — 2ny) ] in no, which 
represents either the incident wave or one of the waves reflected from the surface of the 
wedge in accordance with the laws of geometrical optics. The function represents the 
diffractive distortion of the wave. The field at distances from the edge of the wedge large 
compared with the wavelength is of the greatest interest. When 1, the asymptotic 
formulaf 


vAr,^) 


n 

y^{2nkr) 


gi((cr + i7t) 


sin (n^/y) 

cos (n^/y) — cos imply) 


(94.4) 


holds, provided that the angle ip satisfies the condition 

[cos (Tc^/y) — cos imply) > Mkr. (94.5) 

The dependence on r of the function and therefore of the field uffc, (p) = vAr, <p — <Po) 
+ vAr,4> + <Po). is given by a factor i.e. this field resembles a cylindrical wave 

emitted by the edge of the wedge. 



E-iitM- 54 , 924, 1938. 
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In the form given above, (94.1)-(94.5) are valid for any angles y and (po- The more 
detailed discussion of these formulae will be effected on the assumption (for definiteness) 
that the angles (po and y are so related {y> n + <po) that, in geometrical optics, two 
boundaries are formed: the boundary Ob of the complete shadow (region III in Fig. 43) 
and the boundary Oa of the “shadow” of the wave reflected from the surface OA. In Fig 43’ 
<pQ <\n. If <pQ > ^n, the boundary Oa lies to the right of the direction of the incident wave. 
If y <n + (po, there is no region of complete shadow, and reflection (single or multiple) 
takes place from both sides of the wedge. 

In regions I, II, III the function Mq {r,<P) = (f, <p-<Po) + ^0 <P + <Po) has the following 

forms: 


Region I: 

Region II: 

Region III: Uq = 0. 


(94.6) 


These expressions, which do not vanish as kr ^ 00, describe the incident (region II) or 
incident and reflected (region I) waves, undistorted by diffraction. The diffractive 
distortion of the field is given by formula (94.4), but the condition (94.5) ceases to hold 
when ip approaches n and the difference - 7c| is no longer large compared with l/^J(kr). 

The values \p n correspond to the geometrical boundaries of the shadow; for ^ = 
(p-<Po we have the boundary of the complete shadow, and for tp = <p + <po that of the 
shadow of the reflected wave. In the immediate neighbourhood of these values a different 
asymptotic expression must be used, which is valid if the inequality \ip — n\'4 1 holds. This 
condition, together with kr ^ 1, ensures the validity of the usual approximate theory of 
Fresnel diffraction. Accordingly we have near the boundary Ob of the complete shadow the 
asymptotic expression 

M(r,</>) = e-*'-“*(<^’-^o)-^^ I e-Vdr;, 

w= -(</>-</>o-7t)v/(ikr). 



Similarly, near the boundary Oa of the shadow of the reflected wave 


u(r,<p) = e ‘ 


y/(2n) 


dr]. 


(94.8) 


w= -[(p + <Po-n)^(\kr). J 

In this approximation the diffraction pattern is independent of the angle of the wedge an 
of the direction of polarization of the wave. cqt 

The ranges of applicability of formulae (94.4) and (94.7), (94.8) partly . - is 

example, near the boundary of the complete shadow the common range of apphcabm y 
given by 


\(p-(po-n\> l/y/{kr). 
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Hr, (}>) = Uo{r,(}>) + - |-, (94.9) 

^{2nkr) (j)-(p(,-n 

with «o given by (94.6). This expression can be obtained from (94.7) by using the asymptotic 
^ppiulae for the Fresnel integral with large |w|: 

j = (1 + i)yJ{jn) + ei'^^lliw for w > 0, 

j drj = ^'^^I2iw for w < 0. 


§95. Diffraction by a plane screen 

The exact formula (94.2) for diffraction by a wedge can be brought to a comparatively 
simple form in the particular case of diffraction by a half-plane (y = 2n). The complex 
integral in (94.2) can be reduced to a Fresnel integral: 

tj(r,^) e'Vd?/, 

i (95.1) 

w = ^(2kr)cos^\l/. J 

This formula holds for any values of r and ip. For kr |> 1 and angles \\j/ — n\> l/^(kr) the 
asymptotic expression 


Vj(r,ll/)= -^(kr + in) -^- 

2yJ{2nkr) cos f ^ 


(95.2) 


(formula (94.4) with y = 27t) holds. 

Using formula (95.2), the solution of the problem of diffraction by a plane perfectly 
conducting screen of any shape can be obtained in closed form. The only assumptions are 
that the dimensions of the screen and the distance from it are large compared with the 
"wavelength, and that the angles of diffraction are moderately large (this region overlaps the 
■■cgion of small angles in which the ordinary Fresnel-diffraction formulae are valid). The 
■^csult is in the form of an integral along the edge of the screen, analogous to the expression 
° the diffraction field as an integral over a surface spanning the aperture in a screen, in the 
Ordinary approximate theory. We shall not pause to give the calculations here. 

In the exact theory of diffraction by plane perfectly conducting screens, there is a 
corem (due to L. I. Mandel’shtam and M. A. Leontovich) in some ways analogous to 
^binet’s theorem in the approximate theory. 

Let us consider a plane screen with an aperture of any shape, and take the plane of the 
^ceen as z = 0. Let an electromagnetic wave be incident from the side z < 0, and let Eq, Hq 
the total fields in the incident wave and the wave which would be reflected from the 
^reen if there were no aperture. We assume the .field continued beyond the screen (z > 0). 
'nee = 0, E, = 0 for z = 0 (by the boundary conditions at a perfectly conducting 
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surface), the values of Eq and Hq for z > 0 and z < 0 are related by 

EoJx,y,z) = EoJx,y, -z), Eo,(x,>-,z) = -Eo,(x,>-, -z), j 

Ho, (x, y, z)= -Ho, (x, y, - z). Ho,(x, y, z) = Hq, (x, y, - z). J 


(95.3) 


Next, let E' and H' be the fields which would occur if a flat plate corresponding to the 
aperture in size, shape and position, and having infinite magnetic permeability, were placed 
in the field Eq, Hq. Then the solution of the diffraction problem for the aperture in the 
screen is given by 

E = i(Eo + E'), H = i(Ho + H') for z < 0, j 
E = i(Eo-E'), H = i(Ho-H') for z > 0. i 

To show this, we notice that the fields E', H' have the same symmetry (expressed by 
formulae (95.3)) as the fields Eq, Hq. They therefore satisfy on the plane z = 0 the 
conditions 


E', = 0, H',. = 0 outside the aperture, 

E',1 = — E',2, = —H\2 on the aperture, 

the suffixes 1 and 2 corresponding to z -► + 0. They also satisfy the further conditions 
E\ = 0, H', = 0 on the aperture, 

since the boundary conditions on the surface of a body with = co are obtained from 
those for a perfectly conducting body (e = co) by interchanging E and H. Hence it is clear 
that the fields (95.4) satisfy the necessary conditions E, = 0, = 0 on the surface of the 

screen (z -► — 0) outside the aperture, and are continuous on the aperture. Finally, since E, 
H' tend to Eq, Hq at infinity, the fields (95.4) tend to E(,, Hq as 
z — CO and to zero as z -► + co. They therefore satisfy all the conditions of the problem. 
This proves the theorem. 

Thus the problem of diffraction by an aperture in a screen with e = co is equivalent to a 
problem of diffraction by a complementary screen with p—cc. 


PROBLEMS 


Problem 1. A plane monochromatic wave is incident normally on a slit cut in a perfectly conducting P 
screen, the width la of the slit being large compared with the wavelength. Determine the distribution o 
intensity beyond the slit, at large distances from it and for large angles of diffraction. 


Solution. For a > A, the diffraction field beyond the slit can be regarded as a superposition of field^h^.^ 
from independent diffraction at each of the two edges of the slit and determined by means of the asy 1 
formula (95.2).t When the distances AP = r, and BP = r, from the ed^s of the slit to the point ot obser 
(Fig. 50) are large compared with a, we can put, in the factors and r, = r - a sin y, f Jon X- 

elsewhere r, ^ r j S r; the angles between the z-axis and AP, OP, BP can all be taken as the angle of dittraci 
The result is 


e-tXr+W [ sin (kg sin x) ^ . cos (Ico sin x) | 
,J(2nkr)\ sin)x cos^X J 


t This assumption becomes invalid, however, for diffraction angles sufficiently close to (when 2' 
< l/J(ka)). 
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Hence the intensity of light diffracted into an angle dy is (relative to the total intensity of light incident on the slit) 

For small y this expression becomes the formula for Fraunhofer diffraction by the slit; 


Problem 2. A plane wave is incident on a perfectly conducting plane with a circular aperture whose radius a 
is small compared with the wavelength. Determine the intensity of diffracted light passing through the aperture 
(Rayleigh. 1897). 


Solution. As stated above, this problem is equivalent to that of diffraction by a circular disc with n = cc, 
and, since a <§ A, we have the case of scattering by a small particle. According to §92, in order to solve the latter 
problem it is necessary to determine the static electric and magnetic polarizabilities of the disc. The field Eo is 
perpendicular to the plane of the disc, and the boundary condition E\ = 0 is formally identical with the condition 
in electrostatics at the boundary of a body with e = 0. The field Hq is parallel to the disc, and the boundary 
condition H', = 0 corresponds to a magnetostatic problem with <x>. Hence the electric and magnetic 
moments of the disc are (see §4. Problem 4. and §54. Problem) ^ = 4a^Ho/37t. In going to the 

problem of diffraction by an aperture we must, in accordance with formulae (95.4), divide these expressions by 2 
nnd then substitute them in the scattering formula (92.1). 

Thus the intensity of radiation diffracted into a solid angle do ist 

<i/ = £~{nxEo-2nX(HoXn)}^do 


= £ ^ + 4n ■ Ho XEo} do. 

^he total diffracted intensity is obtained by integration over a hemisphere, and is 
oj*a^ 

diffraction cross- section may be defined as the ratio of the intensity of diffracted radiation to the energy flux 
cnsity in the incident wave cE^/4n (letters without suffixes refer to the incident wave). Two modes of 
•^“■arization of the incident wave may be distinguished: 
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(a) the vector E in the incident wave is perpendicular to the plane of incidence (the xz-plane), i.e. it is parallel to 
the plane of the screen (the x>'-plane). The sum of t Jie fields in the incident and reflected waves at the surface of the 

£o = 0, Ho^ = 2Hcosa = 2Ecosa 
(a being the angle of incidence). Hence 

der = ^^^2*4 ccs^a (1 —sin^ 9 cos^ <#>)do, 

where 9 is the angle between the direction of diffraction n and the normal to the screen (the z-axis), and <p is the 
azimuth of the vector n with respect to the plane of incidence. The total cross-section is 

tf = (64-m‘'a*/277tc‘')cos^a. 

(b) the vector E lies in the plane of incidence. Then = —2E sin a, 74 = = 2H = 2E. The 

differential cross-section is 


16a‘’m‘' , , , 

der =-{cos^ 9 -(- sin^ & (cos^ 4> + i sin^ a) - sin 9 sin a cos (^ } do, 

9Trc* 

and the total cross-section is e; = I21nc'*' )(\ -t-isin^a). 

For natural incident light o = {(Aa^(o*l21nc‘'') (1 -fsin^a). 


CHAPTER XI 


ELECTROMAGNETIC WAVES IN 
ANISOTROPIC MEDIA 


§96. The permittivity of crystals 

The properties of an anisotropic medium with respect to electromagnetic waves are 
defined by the tensors (co) and (co), which give the relation between the inductions and 
the fields:t 

D, = £iJco)E^, B; =(96.1) 

In what follows we shall, for definiteness, consider the electric field and the tensor all the 

results obtained are valid for the tensor also. 

As a}->0, the tend to their static values, which have been shown in §13 to be 
symmetrical with respect to i and k. The proof was thermodynamical, and therefore holds 
only for states of thermodynamic equilibrium. In a variable field, a substance is of course 
not in equilibrium, and the proof in §13 is consequently invalid. To ascertain the properties 
of the tensor E;^ we must use the generalized principle of the symmetry of the kinetic 
coefficients (see SP 1, §125). 

The generalized susceptibilities which appear in the formulation of this principle 
are defined in terms of the response of the system to a perturbation: 

y= -xjjt) 

(where the are quantities describing the system) and are the coefficients in the linear 
relation between the Fourier components of the mean values 3c^(t) and the generalized 
forces/„(t): 

The change in the energy of the system with time under the perturbation is given by 
'According to the symmetry principle, 

«<,t(to) = ttfcjco), 

’f the system is not in an external magnetic field and has no magnetic structure; otherwise, 
“ta(co) has to be taken for the “time-reversed” system. 

It is easy to relate the components of the tensor Ei^(co) to the generalized susceptibilities. 
To do so, we note that the rate of change of the energy of a dielectric body in a variable 


^ t It should be recalled that these quantities refer to the variable fields in the wave; the possible presence of a 
"'nstant induction (in a pyroelectnc or ferromagnetic crystal) is irrelevant to this discussion. 
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electric field is given by the integral 


(96.2) 


A comparison with the above formulae shows that, if the components of the vector E at 
each point are taken as the quantities the corresponding quantities X will be the 
components of D. (The suffix a takes a continuous series of values, labelling both 
the components of the vectors and the points in the body.) The coefficients are then the 
components of the tensor The symmetry properties of are, of course, identical 
with those of its inverse. Since E and D are multiplied in (96.2) only at the same point in the 
body, the interchange of the suffixes a and b is equivalent to simply interchanging the 
tensor suffixes. We thus conclude that the tensor e,-^ is symmetrical;! 


£,.,(£0) = £,i(m). (96.3) 

It should be noted that the components of the polarizability tensor for the whole body, 
i.e. the coefficients in the equations also come under the definition of 

generalized susceptibilities. For the rate of change of the energy of a body placed in a 
variable external field G is 

-^•dG/dt. (96.4) 


Hence we see that, if the are the three components of the vector then the 
corresponding/^ are those of the vector G, so that the coefficients are in this case Va^^. 

Several of the formulae derived previously for an isotropic medium can be oirectly 
generalized to the anisotropic case. Repeating for the anisotropic case the derivation in §80, 
we find that the energy dissipation in a monochromatic electromagnetic field is 

which is analogous to (80.5). The condition that absorption be absent is Ej,^* = E^i = 
the Ej^ must be real, as must the 

When absorption is absent, the internal electromagnetic energy per unit volume 
can be defined as shown in §80. The formula for an anisotropic medium corresponding 
to (80.11) is 


In §87 we used the surface impedance C, in terms of which the boundary conditions at the 
surface of a metal can be formulated even if the permittivity is no longer meaningfu . 
surface of an anisotropic body the boundary condition corresponding to (87.6) is 


where C„«(cn) is a two-dimensional tensor on the surface of the body. It should be 
mind that the value of this tensor depends, in general, on the crystallographic direc 
the surface concerned. 


discussed 


{jlOl. 


The properties of this te 


in the presence of ; 
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The energy flux into the body is (c/47r)E x H • n = (c/47r)E • H x n = (c/47i:)£„(H xn)„. 
[(Here E and H are real.) Hence we see that if, in applying the principle of the symmetry of 
the kinetic coefficients, we take the components as the x^, then the corresponding^ will 
be — (H X n )„, i.e.^ will be — (i/co) (H x n)„ (returning to the complex form). The coefficients 
are therefore the same, apart from a factor, as the components , and we conclude that 

(96.8) 

in the absence of an external magnetic field. 

PROBLEM 

Express the components of the tensor in terms of those of ri^p = £~'^, assuming that the latter exists and 
that the body is non-magnetic (/i,* = S^). 

I Solution. In an anisotropic medium, the equation = 1/a (87.2) becomes In components this 

Cll"+Cl2C2. =')ll, C22"+Cl2C21='l22, 

! Cl2(Cll+C22) = 'll2, C2l(Cll+C22) = 'l21- 

The solution of these equations is 

[ Cl2 = 'll2/«. C21='72./«. 

Cll = I')ll±\/('ill'i22-')l2'/2l)l/4. C22 = ['/22±\/('ill'l22-')l2'l2l)l/^’ 

= 'll! +')22±2 v/(')ii')22 -')i2 ')2 i)- 

The choice of signs is determined by the condition that the absorption of energy must be positive. We do not 
assume C12 = C21. and thereby allow for the presence of an external magnetic field. 

§97. A plane wave in an anisotropic medium 

In studying the optics of anisotropic bodies (crystals) we shall take only the most 
important case, where the medium may be supposed non-magnetic and transparent in a 
given range of frequencies. Accordingly, the relation between the electric and magnetic 
fields and inductions is 

D; = £(,£„ B = H. (97.1) 

The components of the dielectric tensor are all real, and its principal values are positive. 
Maxwell’s equations for the field of a monochromatic wave with frequency co are 

/£oH = ccurlE, jcoD = — ccurlH. (97.2) 

a plane wave propagated in a transparent medium all quantities are proportional to 
with a real wave vector k. Effecting the differentiation with respect to the coordinates, 
'^e obtain 

mH/c=kxE, £oD/c=-kxH. (97.3) 

Hence we see, first of all, that the three vectors k, D, H are mutually perpendicular. 
Moreover, H is perpendicular to E, and so the three vectors D, E, k, being all perpendicular 
M H, must becoplanar. Fig. 51 (p. 334) shows the relative position of all these vectors. With 
'^^spect to the direction of the wave vector, D and H are transverse, but E is not. The 
^'^gram shows also the direction of the energy flux S in the wave. It is given by the vector 
LProduct E X H, i.e. it is perpendicular to both E and H. The direction of S is not the same as 
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Fig. 51 

that of k, unlike what happens for an isotropic medium. Clearly the vector S is coplanar 
with E, D and k, and the angle between S and k is equal to that between E and D. 
We can define a vector n by 

k = con/c. (97.4) 

The magnitude of this vector in an anisotropic medium depends on its direction, whereas in 
an isotropic medium n — yje depends only on the frequencyUsing (97.4), we can write 
the fundamental formulae (97.3) as 


H = nxE, D=-nxH. (97.5) 

The energy flux vector in a plane wave is 

S = cExH/47t = (c/47r){£2n-(E-n)E}; (97.6) 

in this formula E and H are real. 

So far we have not used the relation (97.1) which involves the constant £j,t characterizing 
the material. This relation, together with equations (97.5), determines the function ci;(k). 
Substituting the first equation (97.5) in the second, we have 

D = nx(Exn) = n2E-(n-E)n. {97.7) 

If we equate the components of this vector to CikE,, in accordance with (97.1), we obtain 
three linear homogeneous equations for the three components of E: — n^n^E^ = 


or 


(«2^,,-«in,-%)£fc = 0 


(97.8) 


The compatibility condition for these equations is that the determinant of their coefficient 


should vanish: ni 

det|«^^,.,-n,-«,-eiJ = 0. 

In practice, this determinant is conveniently evaluated by taking as the axes 
principal axes of the tensor £,.,^ (called the principal dielectric axes). Let the principa 
of the tensor be £<"^’, Then a simple calculation gives 


(£<")«/+ £<"»«/ 
+ «^^£<^>(e<^> + e<^>) + 


+ £("V)-[«/£<")(£<")+ £<"’) 

„^2£(z) (glx) + £(^)) ] + £<X)£«k)£(-) = 0. 


(97.10) 
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The sixth-order terms cancel when the determinant is expanded; this is, of course, no 
accident and is due ultimately to the fact that the wave has two, not three, independent 
directions of polarization. 

Equation (97.10), called Fresnel’s equation, is one of the fundamental equations of 
crystal optics.t It determines implicitly the dispersion relation, i.e. the frequency as a 
function of the wave vector. (The principal values e*'’ are functions of frequency, and so are, 
in some cases (see §99), the directions of the principal axes of the tensor £;*.) For 
monochromatic waves, however, to, and therefore all the e*”, are usually given constants, 
and equation (97.10) then gives the magnitude of the wave vector as a function of its 
direction. When the direction of n is given, (97.10) is a quadratic equation, for n^, with real 
coefficients. Hence two different magnitudes of the wave vector correspond, in general, to 
each direction of n. 

Equation (97.10) (with constant coefficients e‘‘’) defines in the coordinates n,,, n^, n^ the 
“wave-vector surface”.t In general this is a surface of the fourth order, whose properties 
will be discussed in detail in the following sections. Here we shall mention some general 
properties of this surface. 

We first introduce another quantity characterizing the propagation of light in an 
anisotropic medium. The direction of the light rays (in geometrical optics) is given by the 
group velocity vector Sco/Sk. In an isotropic medium, the direction of this vector is always 
the same as that of the wave vector, but in an anisotropic medium the two do not in general 
coincide. The rays may be characterized by a vector s, whose direction is that of the group 
velocity, while its magnitude is given by 

n-s=l. (97.11) 

We shall call s the ray vector. Its significance is as follows. 

Let us consider a beam of rays (of a single frequency) propagated in all directions from 
some point. The value of the eikonal \j/ (which is, apart from a factor co/c, the wave phase; 
see §85) at any point is given by the integral J n • dl taken along the ray. Using the vector s 
which determines the direction of the ray, we can put 

iP = Jn-dl = J(n-s/s)d/ = Jd//s. (97.12) 

In a homogeneous medium, s is constant along the ray, so that ip = L/s, where L is the 
length of the ray segment concerned. Hence we see that, if a segment equal (or 
proportional) to s is taken along each ray from the centre, the resulting surface is such that 
the phase of the rays is the same at every point. This is called the ray surface. 

The wave-vector surface and the ray surface are in a certain dual relationship. Let the 
equation of the wave-vector surface be written /(m, k) = 0. Then the group velocity 
vector is 


dco Sf/Bk 
~ 8f/B(o’ 


(97.13) 


i.e. is proportional to Bf/dk, or, what is the same thing (since the derivative is taken for 
constant to), to Bf jBn. The ray vector, therefore, is also proportional to Bf /Bn. But the vector 


t The foundations of crystal optics were laid by A. J. Fresnel in the 1820s, on the basis of mechanical analogies, 
long before the development of the electromagnetic theory. 

t A concept called the “surface of normals” or “surface of indices” has been used; it is obtained by talcing a 
point at a distance 1/n (instead of n) in each direction, but is less convenient. 
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dfjdn is normal to the surface/ = 0. Thus we conclude that the direction of the ray vector of 
a wave with given n is that of the normal at the corresponding point of the wave-vector 
surface. 

It is easy to see that the reverse is also true; the normal to the ray surface gives the 
direction of the corresponding wave vectors. For the equation s -^n = 0, where is an 
arbitrary infinitesimal change in n (for given ro), i.e. the vector of an infinitesimal 
displacement on the surface, expresses the fact that s is perpendicular to the wave-vector 
surface. Differentiating (again for given £o) the equation n • s = 1, we obtain n • -I- s • (5n 

= 0, and therefore n -^s = 0, which proves the above statement. 

This relation between the surfaces of n and s can be made more precise. Let Hq be the 
position vector of a point on the wave-vector surface, and Sq the corresponding ray vector. 
The equation (in coordinates n^,n^,n^) of the tangent plane at this point is Sq * (n — Hq ) = 0, 
which states that Sq is perpendicular to any vector n - Hq in the plane. Since Sq and Hq are 
related by Sq • Hq = 1, we can write the equation as 

So-n=l. (97.14) 

Hence it follows that I/Sq is the length of the perpendicular from the origin to the tangent 
plane to the wave-vector surface at the point Hq. 

Conversely, the length of the perpendicular from the origin to the tangent plane to the 
ray surface at a point Sq is l/«o- 

To ascertain the location of the ray vector relative to the field vectors in the wave, we 
notice that the group velocity is always in the same direction as the (time) averaged energy 
flux vector. For let us consider a wave packet, occupying a small region of space. When the 
packet moves, the energy concentrated in it must move with it, and the direction of the 
energy flux is therefore the same as the direction of the velocity of the packet, i.e. the group 
velocity. It can be demonstrated from (97.5) that the group velocity is in the same direction 
as the Poynting vector. Differentiating (for given to), we obtain 

= mxn-l-Hx,5n, m = nx^E-l-^nxE. (97.15) 

We take the scalar product of the first equation with E and of the second with H, obtaining 
E-^D = H-m-hExH-^n. = D-^E-hExH-^n. 


But D -(iE = £.fc£fc^£i = E - ^D, and so, adding the two equations, we havfe 

ExH-^n = 0, (97.1® 

i.e. the vector ExH is normal to the wave-vector surface. This is the required result.t 
Since the Poynting vector is perpendicular to H and E. the same is true of s. 

s-H = 0, s-E = 0. <97.17' 

A direct calculation, using formulae (97.5), (97.11) and (97.17), gives 
H = sxD, E=-sxH. 

For example, sxH = sx(nxE) = n(s-E) —E(n-s) = —E. 


(97.1^' 


is prof^’ 


t The result thus obtained relates to the instantaneous, as well as to the average energy flux. 
however, the symmetry of the tensor e,, is vital. The result is therefore ijot valid rn the above form for media , 
which is not symmetrical {gyrotropic media, §101). The statement is still valid, however, for the average value 
the Poynting vector (§101, Problem 1). 
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If we compare formulae (97.18) and (97.5), we see that they differ by the interchange of 
E and D, n and s, and (97.19) 

jthe relation n • s = 1 remaining valid, of course). The last of these pairs must be included in 
order that the relation (97.1) between D and E should remain valid. Thus the following 
useful rule may be formulated: an equation valid for one set of quantities can be converted 
into one valid for another set by means of the interchanges (97.19). 

In particular, the application of this rule to (97.10) gives immediately an analogous 
equation for s: 

(£<’’>+ £‘^'£‘^>s/ + £‘^>£‘»’>S^^ ) 

- [S/ (£'"' + £'^') + S/ (£'"> + £'^>) + S,"* (£'"> + £<">) ] + 1 = 0. (97.20) 

This equation gives the form of the ray surface. Like the wave-vector surface, it is of the 
fourth order. Wheh the direction of s is given, (97.20) is a quadratic equation for s^, which 
in general has two different real roots. Thus two rays with different wave vectors can be 
propagated in any direction in the crystal. 

Let us now consider the polarization of waves propagated in an anisotropic medium. 
Equations (97.8), from which Fresnel’s equation has been derived, are unsuitable for this, 
because they involve the field E, whereas it is the induction D which is transverse (to the 
given n) in the wave. In order to take account immediately of the fact that D is transverse, 
we use for the time being a new coordinate system with one axis in the direction of the wave 
vector, and denote the two transverse axes by Greek suffixes, which take the values 1 and 2. 
The transverse components ofequation (97.7)giveD^ = substituting 
where is a component of the tensor inverse to e^p, we have 

(^^^,,-£-‘,,^£, = 0. (97.21) 

The condition for the two equations (a = 1, 2) in the two unknowns Dj, D 2 to be 
compatible is that their determinant should be zero: 

detln-^'^.p-s-^.pl = 0. (97.22) 

This condition is, of course, the same as Fresnel’s equation, which was written in the 
original coordinates x, y, z. We now see also, however, that the vectors D corresponding to 
he two values of n are along the principal axes of the symmetrical two-dimensional tensor 
0 rank two £“‘^p. According to general theorems it follows that these two vectors are 
Perpendicular. Thus, in the two waves with the wave vector in the same direction, the 
e ectric induction vectors are linearly polarized in two perpendicular planes, 
g] ^^aations (97.21) have a simple geometrical interpretation. Let us draw the tensor 
ipsoid corresponding to the tensor £ returning to the principal dielectric axes, i.e. the 
surface 

+ 1 (97.23) 

^^ig. 52, p. 338). Let this ellipsoid be cut by a plane through its centre perpendicular to the 
direction of n. The section is in general an ellipse; the lengths of its axes determine 
values of n, and their directions determine the directions of the oscillations, i.e. the 
^^ctors D. 
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Fig. 52 


From this construction (with, in general, £*»’', different) we see at once that, if the 
wave vector is in (say) the x-direction, the directions of polarization (D) will be the y and z 
directions. If the vector n lies in one of the coordinate planes, e.g. the xy-plane, one of the 
directions of polarization is also in that plane, and the other is in the z-direction. 

The polarizations of two waves with the ray vector in the same direction have similar 
properties. Instead of the directions of the induction D, we must now consider those of the 
vector E, which is transverse to s, and equations (97.21) are replaced by the analogous 
equations 



The geometrical construction is here based on the tensor ellipsoid 

= 1, (97.25) 

corresponding to the tensor £,.* itself (called the Fresnel ellipsoid). 

It should be emphasized that plane waves propagated in an anisotropic medium are 
linearly polarized in certain planes. In this respect the optical properties of anisotropic 
media are very different from those of isotropic media. A plane wave propagated in an 
isotropic medium is in general elliptically polarized, and is linearly polarized only in 
particular cases. This important difference arises because the case of complete isotrspy of 
the medium is in a sense one of degeneracy, in which a single wave vector corresponds to 
two directions of polarization, whereas in an anisotropic medium there are in general two 
different wave vectors (in the same direction). The two linearly polarized waves pro¬ 
pagated with the same value of n combine to form one elliptically polarized wave. 

PROBLEM 

Express the components of the ray vector s in terms of the components of n along the principal dielectric axes- 

Solution. Differentiating the left-hand side of the equation /(n) = 0 (97.10) with respect to «,■ 
determining from the condition n - s = 1 the proportionality coefficient between Sj and d//dn,. we obtain 
following relations between the vectors s and n; 

s,. e'*’ (a'^’ -I- a'^’) - 2a'*>n,^ - (a'*> -t- a'^>)"/ - (a'*’ -b c">)nz^ 

^ 2a''>a<>’>a"> - nz^a<">(a<»> -b a<">) - n/a<»'(a<*' -t- a<">) -n^^a'^'la"’ -b aO"’) 


and similarly for 
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§98. Optical properties of uniaxial crystals 

The optical properties of a crystal depend primarily on the symmetry of its dielectric 
tensor In this respect all crystals fall under three types; cubic, uniaxial and biaxial (see 
§13). In a crystal of the cubic system = e^,.*, i.e. the three principal values of the tensor 
are equal, and the directions of the principal axes are arbitrary. As regards their optical 
properties, therefore, cubic crystals are no different from isotropic bodies. 

The uniaxial crystals include those of the rhombohedral, tetragonal and hexagonal 
systems. Here one of the principal axes of the tensor E;* coincides with the threefold, 
fourfold or sixfold axis of symmetry respectively; in optics, this axis is called the optical axis 
of the crystal, and in what follows we shall take it as the z-axis, denoting the corresponding 
principal value of e^j by E|| . The directions of the other two principal axes, in a plane 
perpendicular to the optical axis, are arbitrary, and the corresponding principal values, 
which we denote by Ej^, are equal. 

If in Fresnel’s equation (97.10) we put £'"> = £<^' = e^, £<^) = the left-hand side is a 
product of two quadratic factors: 


In other words, the quartic equation gives the two quadratic equations 



E|l 


(98.1) 

(98.2) 


Geometrically, this signifies that the wave-vector surface, which is in general of the fourth 
order, becomes two separate surfaces, a sphere and an ellipsoid. Fig. 53 shows a cross- 
section of these surfaces. Two cases are possible: if Ej^ > e„, the sphere lies outside the 
ellipsoid, but if Ej^ < £|| it lies inside. In the first case we speak of a negative uniaxial crystal, 
and in the second case of a positive one. The two surfaces touch at opposite poles on the 
axis. The direction of the optical axis therefore corresponds to only one value of the 
wave vector. 

and^98^ surface is similar in form. By the rule (97.19), its equation is obtained from (98.1) 
= l/Ei, (98.3) 

El + EII (s,,^ + s/) = 1. (9g 4) 




Fig. 53 
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In a positive crystal the ellipsoid lies within the sphere, and in a negative one outside. 

Thus we see that two types of wave can be propagated in a uniaxial crystal. With respect 
to one type, called ordinary waves, the crystal behaves like an isotropic body with refraaive 
index n = The magnitude of the wave vector is con/c whatever its direction, and the 
direction of the ray vector is that of n. 

In waves of the second type, called extraordinary waves, the magnitude of the wave 
vector depends on the angle 6 which it makes with the optical axis. By (98.2) 


The ray vector in an extraordinary wave is not in the same direction as the wave vector, but 
is coplanar with that vector and the optical axis, their common plane being called the 
principal section for the given n. Let this be the zx-plane; the ratio of the derivatives of 
the left-hand side of (98.2) with respect to n^ and n^ gives the direction of the ray vector: 
s^ls^ = Thus the angle & between the ray vector and the optical axis and the 

angle 6 satisfy the simple relation 

tan6'= (£x/eii)tan6. (98.6) 

The directions of n and s are the same only for waves propagated along or perpendicular to 
the optical axis. 

The problem of the directions of polarization of the ordinary and extraordinary waves is 
very easily solved. It is sufficient to observe that the four vectors E, D, s and n are always 
coplanar. In the extraordinary wave s and n are not in the same direction, but lie in the same 
principal section. This wave is therefore polarized so that the vectors E and D lie in the 
same principal section as s and n. The vectors D in the ordinary and extraordinary waves 
with the same direction of n (or E, with the same direction of s) are perpendicular. Hence 
the polarization of the ordinary wave is such that E and D lie in a plane perpendicular to 
the principal section. 

An exception is formed by waves propagated in the direction of the optical axis. In this 
direction there is no difference between the ordinary and the extraordinary wave, and so 
their polarizations combine to give a wave which is, in general, elliptically polarized. 

The refraction of a plane wave incident on the surface of a crystal is different from 
refraction at a boundary between two isotropic media. The laws of refraction and reflection 
are again obtained from the continuity of the component n, of the wave vector which is 
tangential to the plane of separation. The wave vectors of the refracted and reflected waves 
therefore lie in the plane of incidence. In a crystal, however, two different refracted waves 
are formed, a phenomenon known as double refraction or birefringence. They correspon ^ 
to the two possible values of the normal component which satisfy Fresnel s equation 
a given tangential component n, . It should also be remembered that the observed directio^ 
of propagation of the rays is determined not by the wave vector but by the ray 
whose direction is different from that of n and in general does not lie in the p 
incidence. , 

In a uniaxial crystal, ordinary and extraordinary refracted waves are formed, 
ordinary wave is entirely analogous to the refracted wave in isotropic bodies; in parti 
lar its ray vector (which is in the same direction as its wave vector) lies in the plane ot m 
dence. The ray vector of the extraordinary wave in general does not lie in the plane 
incidence. 
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Problem 1. Find the direction of the extraordinary ray when light incident from 
irface of a uniaxial crystal which is perpendicular to its optical axis. 


Solution. In this case the refracted ray lies in the plane of incidence, which we take as the xz-plane, with the 
■axis normal to the surface. The x-component of the wave vector n, = sin S (S being the angle of incidence) is 
mtinuous; the component for the refracted wave is found from (98.2): 




n of the refracted ray is given by (98.6): 

y[c,(£,-sin^i>)] 


e S ' is the angle of refraction. 


I Solution . The refracted ray lies in the xz-plane, which passes through the normal to the surface (the z-axis) 
and the optical axis. Let a be the angle between these axes. The ray vector s, whose components are proportional 
to the derivatives of the left-hand side of equation (98.2) with respect to the corresponding components of n, is 
proportional to 


§99. Biaxial crystals 


In biaxial crystals the three principal values of the tensor are all different. The crystals 

0 the triclinic, monoclinic and orthorhombic systems are of this type. In those of the 
hiclinic system, the position of the principal dielectric axes is unrelated to any specific 
^rystallographic direction; in particular, it varies with frequency, as do all the components 
In crystals of the monoclinic system, one of the principal dielectric axes is 
ystallographically fixed; it coincides with the twofold axis of symmetry, or is per- 
o^n icular to the plane of symmetry. The position of the other two principal axes depends 
n t e frequency. Finally, in crystals of the orthorhombic system, the position of all three 
mnapal axes is fixed: they must coincide with the three mutually perpendicular twofold 
“^es of symmetry. 

The study of the optical properties of biaxial crystals involves the consideration of 
resnel’s equation m its general form. We shall assume for definiteness that 


“ ^ ^ ■ (yy.j 

1 To ascertain the form of the fourth-order surface defined by equation (97 10) let i 
Begin by finding its intersections with the coordinate planes. Putting n, = 0 in equatio 
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(97.10), we find that the left-hand side is the product of two factors: 

Hence we see that the section by the xy-plane consists of the circle 


and the ellipse 




and by the assumption (99.1) the ellipse lies inside the circle. Similarly we find that the 
sections by the yz and xz planes are also composed of an ellipse and a circle, in the y p ane 
dhpse lies outside the circle, and in the xz-plane they intersect. Thus the wave-vector 
surface intersects itself, and is as shown in Fig. 54, where one octant is drawn. 



This surface has four singular points of self-intersection, one in each b) 

xz-plane. The singular points of a surface whose ^^^^tion is/(« , V j^ffhand sid^ 

the vanishing of all three first derivatives of the function/. Differentiat g 
of (97.10), we obtain the equations 

Lzia^.x) , .(yu_eU)„2_(g(x)„^2 + eU)„^2^g(x)„^2)j ^0; J 


the equation (97.10) itself must, of course, be satisfied also. Since we know that the require 
dUecfions of n lie in the xz-plane, we put n, = 0, and the two remaining equations gi 
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|mmediatelyt 



(99.5) 


The directions of these vectors n are inclined to the z-axis at an angle P such that 


~ = ±tan)? = 




(99.6) 


This formula determines lines in two directions in the xz-plane, each of which passes 
|hrough two opposite singular points and is at an angle P to the z-axis. These lines are called 
the optical axes or binormals of the crystal; one of them is shown dashed in Fig. 54. The 
pirections of the optical axes are evidently the only ones for which the wave vector has only 
one magnitude.t 


The properties of the ray surface are entirely similar. To derive the corresponding 
psrmulae, it is sufficient to replace n by s and e by l/£. In particular, there are two optical ray 
axes or biradials, also lying in the xz-plane and at an angle y to the z-axis, where 




Since £*^' < £*^', y < p. 

The directions of corresponding vectors n and s are the same only for waves propagated 
along one of the coordinate axes (i.e. the principal dielectric axes). If n lies in one of the 
Inordinate planes, s lies in that plane also. This rule, however, is subject to an important 
exception for wave vectors in the direction of the optical axes. 

When the values of n given by (99.5) are substituted in the general formulae for s in terms 
of n (§97, Problem), these take the indeterminate form 0/0. The origin and meaning of this 
Pdeterminacy are quite evident from the following geometrical considerations. Near a 
Singular point, the inner and outer parts of the wave-vector surface are cones with a 
common vertex. At the vertex, which is the singular point itself, the direction of the normal 
to the surface becomes indeterminate; and the direction of s as given by these formulae is 
just the direction of the normal. In fact the wave vector along the binormal corresponds to 
an infinity of ray vectors, whose directions occupy a certain conical surface, called the cone 
0/ internal conical refraction.^ 

To determine this cone of rays, we could investigate the directions of the normals near 
toe singular point. It is more informative, however, to use a geometrical construction from 
toe ray surface. 

Fig. 55 (p. 344) shows one quadrant of the intersection of the ray surface with the xz- 
Plane (continuous curves), and also the intersection of the wave-vector surface, on a different 
^le. The line OS is the biradial, and ON the binormal. Let nw be the wave vector 
Btresponding to the point TV. It is easy to see that the singular point TV on the wave-vector 
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surface corresponds to a singular tangent plane to the ray surface. This plane is 
perpendicular to ON, and touches the ray surface not at one point but along a curve, which 
is found to be a circle. In Fig. 55 the trace of this plane is shown by ab. This follows at once 
from the geometrical correspondence between the wave-vector surface and the ray surface 
(§97): if the tangent plane is drawn at any point s of the ray surface, then the perpendicular 
from the origin to this plane is in the same direction as the wave vector n corresponding to 
s, and its length is l/n. In our case there must be an infinity of vectors s corresponding to the 
single value n = hence the points on the ray surface which represent these vectors s 
must lie in one tangent plane, which is perpendicular to . Thus in Fig. 55 the triangle Oab 
is the section of the cone of internal conical refraction by the xz-plane. 

There is no especial difficulty in carrying out a quantitative calculation corresponding to 
this geometrical picture, but we shall not do so here, and give only the final formulae. The 
equations of the circle in which the cone of refraction cuts the ray surface are 




(99.8) 


The first of these equations is the equation of the cone of refraction if s,,, s^, s^ are regan 
as three independent variables. The second is the equation of the tangent plane 
surface. In particular, for = 0 equation (99.8) gives the two equations 


rded 
the ra> 


s„ /e(0(e(y)_£W) _ / eW(£W-£*">) 


which determine the directions of the extreme rays (respectively Oa and Ob in Fig. 55) 
section by the xz-plane. The former is along the binormal (cf. (99.6)), which li- H 
pendicular to the tangent ab. 
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Similar results hold for the wave vectors corresponding to a given ray vector. The vector 
s along the biradial corresponds to an infinity of wave vectors, whose directions occupy the 
cone of external conical refraction. In Fig. 55 the triangle Oa'b' is the section of this cone by 
the xz-plane. The corresponding formulae are again obtained by substituting n for s and 
l/£ for e in the formulae (99.8), and are 


eW(£(z) _eW)„^2 ^ (e(^) _gW)_(gO) _gW) j ^ 

X [«,,£'"> y(£<-) - £<^)) - y (£<» _ g(x)) ] = 0, 

«.v/- e'"*) + «z J - £‘^') ]. 


(99.9) 


In observations of the internal conical refraction! we can use a flat plate cut 
perpendicular to the binormal (Fig. 56). The surface of the plate is covered by a diaphragm 
of small aperture, which selects a narrow beam from a plane light wave (i.e. one whose wave 
vector is in a definite direction) incident on the plate. The wave vector in the wave 
transmitted into the plate is in the direction of the binormal, and so the rays are on the cone 
of internal refraction. The wave vector in the wave leaving the other side of the plate is the 
same as in the incident wave, and so the rays are on a circular cylinder. 



To observe the external conical refraction, the plate must be cut perpendicular to the 
biradial, and both its surfaces must be covered by diaphragms having small apertures in 
exactly opposite positions. When the plate is illuminated by a convergent beam (i.e. one 
containing rays with all possible values of n), the diaphragms admit to the plate rays with s 
a ong the biradial, and therefore with directions of n occupying the surface of the cone of 
external conical refraction. The light leaving the second aperture is therefore on a conical 
surface, although this does not exactly coincide with the cone of external refraction, on 
account of the refraction on leaving the plate. 

The laws of refraction at the surface of a biaxial crystal for an arbitrary direction of 
mcidence are extremely complex, and we shall not pause to discuss them here i but only 
; mention that, unlike what happens for a uniaxial crystal, both refracted ’waves are 
f extraordinary” and the rays of neither lie in the plane of incidence 

As specified in §97, we are describing the optics of transparent crystals, but we may note 
uere that there is a property of biaxial crystals that may occur when absorption is taken 
mto account. 


t We shall describe only the principle of the experiment. 

t A detailed account of the calculations may be found in the article by G. Szivessv 
''ol. XX, Chapter 11, Springer, Berlin, 1928. ’ ^ ’ 
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Let us consider a homogeneous plane wave propagated in the crystal, in which n is a 
complex vector but its real and imaginary parts are in the same direction: n = «v, where v is 
a real unit vector, n = n{co) a complex quantity. For a given v, the dispersion equation 
(97.21) can be expanded as 

(j; 11 + r;22) +'ll 1'll! - = 0, 

where and 1 and 2 are tensor suffixes in the plane perpendicular to v. This 

equation quadratic in has a multiple root if 

'?22-'?ii = ±2 «'?i2; (99.10) 

then = 2('ll!+'122)- When absorption is present, the tensor is 

complex. 

In biaxial crystals, the tensor ellipsoids of ij;*' and have three unequal axes; the ratios 
of the axes are different for the two tensors (and so are their directions, in triclinic and 
monoclinic crystals). Under these conditions, the two-dimensional tensors rij and 
cannot in general be simultaneously brought to diagonal form. The angle 9 between the 
principal axes of the two tensors is a function of two independent variables, the angles 
which specify the direction of v. For a given frequency oj. therefore, there can exist a one- 
parameter set of directions of v for which 9 = :J7C. With this value of 9, the imaginary part 
of the complex equation (99.10) is satisfied identically; the real part is 

I 2 -Vi =+{^ 2 ”-fli'l (99-11) 

where the suffixes 1 and 2 denote the principal values of the tensors concerned.! For any 
choice of the x, and Xj axes, equations (97.21) now give 

1^2/l^i = {V22-Vll)/^Vl2 = ±i, 

the two signs on the right corresponding to those in (99.10). Thus the conditions 9 = 
and (99.11) together determine, for each value of o), a particular direction of v in which only 
a wave with circular polarization of one sign, left or right according to the sign for which 
(99.10) is satisfied, can be propagated (W. Voigt, 1902). This direction in the crystal is called 
the singular optical axis or the circular optical axis. 

In accordance with the general theory of linear differential equations, the secon 
independent solution of the field equations then contains, not only the exponential Sector 
d’"'-' (which includes the damping), but also the factor a + bvr linear in the coordinates.? 
The polarization of this wave varies along the ray, but ultimately, as v r increases, a circu ar 
polarization is established similar to that in the first wave, as is obvious if we note that in t 
limit concerned the substitution of the solution in the field equations involves differentia 
ing only the exponential factor, and the difference between the two solutions the 
disappears. . -e 

We should again emphasize the difference between the singular axis and the case w 
the dispersion equation necessarily has a double root because of the symmetry o 
crystal. For light propagated along the optical axis of a uniaxial crystal, the tensor ri^p 


t This is easily proved by taking the x, and Xj axes along the principal axes of the tensor and expressing 
components of in terms of its principal values. , „ . j r . nfligW 

t This solution has to be taken into account, for example, in problems of the reflection and refraction 
propagated along the singular axis. 
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the form and the condition (99.10) is satisfied identically. Equations (97.21) then allow 
two independent solutions with different polarizations. 

§100. Double refraction in an electric field 

An isotropic body becomes optically anisotropic when placed in a static electric field. 
This anisotropy may be regarded as the result of a change in the permittivity due to the 
static field. Although this change is relatively very slight, it is important here because it 
leads to a qualitative change in the optical properties of bodies. 

In this section we denote by E the static electric field in the body,t and expand the 
dielectric tensor in powers of E. In an isotropic body in the zero-order approximation, 
we have There can be no terms in which are of the first order in the field, 

since in an isotropic body there is no constant vector with which a tensor of rank two linear 
in E could be constructed. The next terms in the expansion of Ef^ must therefore be 
quadratic in the field. From the components of the vector E we can form two symmetrical 
tensors of rank two, and The former does not alter the symmetry of the tensor 
and the addition of it amounts to a small correction in the scalar constant which 

evidently does not result in optical anisotropy and is therefore of no interest. Thus we 
arrive at the following form of the dielectric tensor as a function of the field: 

= ( 100 . 1 ) 

where a is a scalar constant. 

One of the principal axes of this tensor coincides with the direction of the electric field, 
and the corresponding principal value is 

£ii = £‘°' + a£^ (100.2) 

The other two principal values are both equal to 

Ex = (100.3) 

and the position of the corresponding principal axes in a plane perpendicular to the field is 
arbitrary. Thus an isotropic body in an electric field behaves optically as a uniaxial crystal 
(the Kerr effect). 

The change in optical symmetry in an electric field may occur in a crystal also (for 
example, an optically uniaxial crystal may become biaxial, and a cubic crystal may cease to 
be optically isotropic), and here the effect may be of the first order in the field. This linear 
effect corresponds to a dielectric tensor of the form 

% = (100.4) 

where the coefficients form a tensor of rank three symmetrical in the suffixes i and k. 
The symmetry of this tensor is the same as that of the piezoelectric tensor. The effect in 
question therefore occurs in the twenty crystal classes which admit piezoelectricity. 

§101. Magnetic-optical effects 

In the presence of a static magnetic field H.J the tensor (co; H) is no longer 

+ Not to be confused with the weak variable electric field of the wave. 
t Not to be confused with the weak variable field of the electromagnetic wave. 
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symmetrical. The generalized principle of symmetry of the kinetic coefficients relates the 
components £,^ and 6^; in different fields; 

£i,(H) = £,..(-H). (101.1) 

The condition that absorption be absent requires that the tensor should be Hermitian: 

^ik = %i*. (101.2) 

as is seen from (96.5), but not that it should be real. Equation (101.2) implies only that the 
real and imaginary parts of Ejh must be respectively symmetrical and antisymmetrical: 

Using (101.1), we have 

£,; (H) = £,/ (H) = £;,'(-H), 

£/ (H)= -£,/' (H)= -£,,"(-H), 

i.e. in a non-absorbing medium £;^' is an even function of H, and £;^" an odd function. 

The inverse tensor £”Sk evidently has the same symmetry properties, and is more 
convenient for use in the following calculations. To simplify the notation we shall writef 


(101.3) 

(101.4) 


£ \k = 1ik = 1ik+ilik”, ( 101 - 5 ) 

as already used above. 

Any antisymmetrical tensor of rank two is equivalent (dual) to some axial vector; let the 
vector corresponding to the tensor rjn^" be G. Using the antisymmetrical unit tensor ejy, we 
can write the relation between the components f/jn" and G; as 

riik' = Ciw G„ (101.6) 

or, in components, = G^, rj^^" = Gy, tjyJ’ = The relation E, = fjjn between the 
electric field and induction becomes 

E. = -I- G,) -I- i (D X G);. (101-7) 

There is a similar linear relation » 

D,. = £,.;£,-hi(Exg).. (101.8) 


The connection between the coefficients in (101.7) and (101.8) is given by 




(101.9) 


where |£| and |£'| are the determinants of the tensors £,* and £,■*'; cf §22, 
medium in which the relation between E and D is of this form is said to be gyrotropic- 


vector g is called the gyration vector, and G the optical activity vector. 

We shall give a general discussion of the nature of waves propagated in an arbitra y 


t Of course, and 


the tensors 
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gyrotropic medium, assumed anisotropic, with no restriction on the magnitude of the 
magnetic field.t 

We take the direction of the wave vector as the z-axis. Then equations (97.21) become 
('lap - ^ Dp = (g^p + ig^p" -1 Dp = 0. (101.10) 


where the suffixes a, p take the values x, y. The directions of the x and y axes are taken 
along the principal axes of the two-dimensional tensor .ri^p'; and we denote the 
Icorresponding principal values of this tensor by l/woi^ and l/no2^- Then the equations 
I become 


(^ -I- iG, = 0, 


( 101 . 11 ) 


The condition that the determinant of these equations vanishes gives an equation 
I quadratic in n^: 


whose roots give the two values of n for a given direction of n:t 
Substituting these values in equations (101.11), we find the corresponding ratios D^/D^: 


The purely imaginary value of the ratio signifies that the waves are elliptically 

polarized, and the principal axes of the ellipses are the x and y axes. The product of the two 
values of the ratio is easily seen to be unity. Thus, if in one wave = ipD^, where the real 
quantity p is the ratio of the axes of the polarization ellipse, then in the other wave 
r ~ ~ P- This means that the polarization ellipses of the two waves have the same axis 

ratio, but are rotated 90‘ relative to each other, and the directions of rotation in them are 
opposite (Fig. 57, p. 350). 

If the vectors D in the two waves are denoted by Dj and D2, these relations may be 
"'ritten Dj Dj* = D2* = 0. This is a general property of the eigenvectors 

on reduction to diagonal form of an Hermitian tensor (in this case, the tensor g^p). 


m- 


t T he medium is again assumed noi 
Pit (to) = Sii^. This, however, does not 
Play differ from unity). 


n-magnetic with respect to the variable field of the electromagnetic wave, i.e. 
exclude a static field magnetizing the medium (i.e. the static permeability 


The properties derived for e,, (to) are equally applicable to the tensor (to) in a frequency range where the 
[ dispersion of the magnetic permeability is of importance. P-r' I q y ge wnere the 

t When there is "7 ^ " 02 - It should be remembered, however, that when the field is 
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Fig. 57 


The components G; and are functions of the magnetic field. If, as usually happens, the 
magnetic field is fairly weak, we can expand in powers of the field. The vector G is zero in 
the absence of the field, and so for a weak field we can put 

(101.15) 

where /j^ is a tensor of rank two, in general not symmetrical. This dependence is in 
accordance with the general rule whereby, in a transparent medium, the components of the 
antisymmetrical tensor i/j^" (and £,n") must be odd functions of H. The symmetrical tensor 
components rjn,' are even functions of the magnetic field. The first correction terms 
(which do not appear in the absence of the field) in rjn,' are therefore quadratic in the field. 
When second-order quantities are neglected, formulae (101.9) reduce to the simpler form 


'?«' = £' \k- 

In the general case of an arbitrarily directed wave vector, the magnetic field has little 
effect on the propagation of light in the crystal, causing only a slight ellipticity of the 
oscillations, with an axis ratio of the polarization ellipse which is small (of the first order 
with respect to the field). 

The directions of the optical axes (and neighbouring directions) form an exception. The 
two values of n are equal in the absence of the field. The roots of equation (101.12) then 
differ from these values by first-order quantities,! and the resulting effects are analogeus to 
those in isotropic bodies, which we shall now consider. 

The magnetic-optical effect in isotropic bodies (and in crystals of the cubic system) is o 
particular interest on account of its nature and its comparatively large magnitude. 

Neglecting second-order quantities, we have i/jn' = e“ ‘ where e is the permittivity 
the isotropic medium in the absence of the magnetic field. The relation between D an • 

E = -D-l-iDxG, D = eE-l-iExg; (101.l6) 


in the same approximation, the vectors g and G are related by 
G=-g/E\ 


(101.17) 


t It should be noticed that the two roots of (101.12) do not become equal. The geometrical significance of th* 
that the two parts of the wave-vector surface are separated. 
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The dependence of g (or G) on the external field reduces in an isotropic medium to simple 
proportionality: 

g=/H, (101.18) 

in which the scalar constant / may be either positive or negative. 

In equation (101.12) we now have n^i = = Wo = s/^, the refractive index in the 

absence of the field. Hence 1/n^ = +G^+ 1/Wo^ to the same accuracy, 

= + = (101.19) 

Since the z-axis is in the direction of n, we can write this formula, to the same accuracy, in 
the vector form 



Hence we see that the wave-vector surface in this case consists of two spheres with radius 
Hq, whose centres are at distances + g/lnf^ from the origin in the direction of G. 

A different polarization of the wave corresponds to each of the two values of n: we have 

D^=+iD^, (101.21) 

where the signs correspond to those in (101.19). The equality of the magnitudes of and 
Dj,, and their phase difference of + \n, signify a circular polarization of the wave, with the 
direction of rotation of the vector D respectively anticlockwise and clockwise looking 
along the wave vector (or, to use the customary expressions, with right-hand and left-hand 
! polarization respectively). 

The difference between the refractive indices in the left-hand and right-hand polarized 
waves has the result that two circularly polarized refracted waves are formed at the surface 
of a gyrotropic body. This phenomenon is called double circular refraction. 

Let a linearly polarized plane wave be incident normally on a slab of thickness /. We take 
the direction of incidence as the z-axis, and that of the vector E ( = D) in the incident wave 
as the x-axis. The linear oscillation can be represented as the sum of two circular 
i Oscillations with opposite directions of rotation, which are then propagated through the 
slab with different wave numbers k^_ = a)n_^/c. Arbitrarily taking the wave amplitude as 
unity, we have = \ [exp {ik + z) -I- exp (ik _ z) ], = 2' [ - oxp ( ik + z) -|- exp (ik _ z) ], or, 

putting k = i(k.^-I-k_) and »c = i(/c^. -k_), 

= 2 + e cos KZ, 

I)y= 2 sin kz. 

'^hen the wave leaves the slab we have 

^y/^x = tan kI = tan (lcog/2cno). (101.22) 

Since this ratio is real, the wave remains linearly polarized, but the direction of polarization 
's changed (the Faraday effect). The angle through which the plane of polarization is 
^tated is proportional to the path traversed by the wave; the angle per unit length in the 
■Prection of the wave vector is 

(cog/lend) cos 6, 

1 Where 6 is the angle between n and g. 


(101.23) 
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It should be noticed that, when the direction of the magnetic field is given, the direction 
of rotation of the plane of polarization (with respect to the direction of n) is reversed (left- 
hand becoming right-hand, and vice versa) when the sign of n is changed. If the ray 
traverses the same path twice in opposite directions, the total rotation of the plane of 
polarization is therefore double the value resulting from a single traversal. 

For 6 =\n (the wave vector perpendicular to the magnetic field), the effect linear in the 
field given by formulae (101.19) disappears, in accordance with the general rule stated 
above that only the component of g in the direction of n affects the propagation of light. 
For angles 6 close to ^Tt we must therefore take account of the terms proportional to the 
square of the field, and in particular these terms must be included in the tensor f/j^'. By 
virtue of the axial symmetry about the direction of the field, two principal values of the 
symmetrical tensor f/j^' are equal, as for a uniaxial crystal. We shall take the x-axis in the 
direction of the field, and denote by g n and f/x the principal values of f/,n' in the directions 
parallel and perpendicular to the magnetic field. The difference f/ n — f/x is proportional to 

Let us consider the purely quadratic effect (called the Cotton-Mouton effect) which 
occurs when n and g are perpendicular. In equations (101.11) and (101.12) we have = 0, 
and l/n^ff, l/no2^ respectively f/n, f/x- Thus in one wave we have 1/n^ = f/n, = 0; this 
wave is linearly polarized, and the vector D is parallel to the x-axis. In the other wave 
1 /n^ = f/x, = 0, i.e. D is parallel to the y-axis. Let linearly polarized light be incident 
normally on a slab in a magnetic field parallel to its surface. The two components in the slab 
(with vectors D in the xz and yz planes) are propagated with different values of n. 
Consequently the light leaving the slab is elliptically polarized. 

Lastly, let us consider one other peculiar effect that occurs in a medium whose optical 
activity vector (101.15) is linear in the (static) magnetic field, namely the magnetization ofa 
non-magnetic transparent medium by a variable electric field (L. P. Pitaevskil, 1960). 

We start from the general formula (31.6) 

■-B/4n = dU/dH, 

and take account of the contribution to U from the variable electric field, which is given by 
(80.11). According to the theorem of small increments to thermodynamic quantities, the 
change in this contribution when the permittivity changes by a small amount is 
(expressed in terms of the appropriate variables) the same as the change 3F in the free 
energy. For the latter we can use formula (14.1), with an obvious generalization to 
anisotropic media; the fact that this formula remains valid for a variable field (not a static 
field as in §14) in a dispersive transparent medium has been mentioned in §81.t We thus 
have 

3U = -3Ei^EiE^*/l6n 

= 3gi^DiD^*/l6n- ( 101 . 24 ) 

the extra factor ^ takes account of the complex representation of E. The second equation 
(101.24) follows because the definition £„ = 3i^^ gives ^f/,* = 


t The tilde above the symbol U refers here to the magnetic variables, not the electric ones. To simplify the 
notation, we omit the sign of time averaging from U. 

t To derive (101.24) directly, it would be necessary to consider a dielectric-filled resonator instead of the 
oscillatory circuit discussed in §81. By calculating the change in frequency due to a small change in the 
permittivity (cf §90, Problem 4) and using the adiabatic invariant theorem, we find the change in the resonator 
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Now regarding the permittivity variation as the result of changing the static magnetic 
field, we write 

B _ai7o S*likD,D* 

dU 1671 ’ 


where Uq refers to the medium in the absence of the electric field. If the medium itself is 
non-magnetic (p = 1), then dUJdU = -H/47r. The magnetization M = (B-H)/47t is 
then 


M = - 


StJik DiDif 
^ 167t 


In the absence of the external magnetic field, the derivative drjiJd H is to be taken at 
H = 0. With f/i^ from (101.6), and (101.15), we obtain finally the following expression for 
the magnetization due to the variable electric field: 

M, = — (i‘/167t)ei^„,X.i (101.25) 

this is quadratic in the electric field. If the medium is isotropic in the absence of the 
magnetic field, then = fS^, and 

M = -i/DxDVlbTt. (101.26) 

For a linearly polarized field, the vector D can differ from a real quantity only by a phase 
factor; then D and D* are collinear, and (101.25) or (101.26) is zero. There is thus a 
magnetization only in the presence of a rotating electric field. This effect is in a sense the 
opposite of the rotation of the polarization plane in a magnetic field, and is expressed in 
terms of the same tensor it is therefore called the inverse Faraday effect. 


PROBLEMS 

Problem 1. Show by direct calculation that the direction of the (time) averaged Poynting vector in a wave 
propagated in a transparent gyrotropic medium is the same as that of the group velocity. 

Solution. According to (59.9a), 

S = creE*xH/8R. 

E and H being expressed in complex form. Proceeding as in the derivation of (97.16). we multiply equations 
(97.15) by E* and H* respectively: 

E*-^D = H*-^H + (E*xH)-^n, 

H*-^H = D*-^E + (ExH*)-^n. 

Adding these and noting that E* - ^D = D* -^E, since the tensor is Hermitian. we find the required result: 
^n-re(E*xH) = 0. 

Problem 2. Determine the directions of the rays when a ray incident from a vacuum is refrarted at the 
surface of an isotropic body in a magnetic field. 

Solution. The direction of the ray vector s is given by the normal to the wave-vector surface. Differentiating 
the left-hand side of equation (101.20) with respect to the components of the vector n, we find that s is 
proportional to n ± g/2no. The square of the latter expression is and so the unit vector in the direction of the 
ray is given by 



Let the angle of incidence be 6. The refracted rays do not in general lie in the plane of incidence, and their 
directions are given by the angle & to the normal to the surface and the azimuth <!>' measured from the plane of 
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incidence. We take the latter as the xz-plane, with the z-axis perpendicular to the surface. The components and 
n of the wave vector are unaltered by refraction. In the incident ray they are = sin 6, ti^ = 0. Substituting these 
values in (1), we find the x and y components of the unit vector s/s, which give immediately the directions of the 
refracted rays; 


sin ff cos </>' 


When the angle of incidence i 


small, the azimuth <j>' is small, and we can write 

<P' = ±V2"oSine, 



For normal incidence {6 = 0) we take the xz-plane through the vector g; then <j>' = 0, and ff S sin 6' = 
+ gJ2nc^. Although this formula does not involve g„ it is not valid if g^ = 0, since the approximation linear in the 
field is inadequate when n and g are perpendicular. 

Problem 3. Determine the polarization of the reflected light when a linearly polarized wave is incident 
normally from a vacuum on the surface of a body rendered anisotropic by a magnetic field. 

Solution. For normal incidence the direction of the wave vector is unaltered by the passage of the wave into 
the medium. In all three waves (incident, refracted and reflected) the vectors FI are therefore parallel to the surface 
(the xy-plane). The electric vector E in the incident and reflected waves is also parallel to (he xy-plane; in the 
refracted wave # 0, but the relation between the x and y components of E and FI is the same as in an isotropic 
body (f/„ = — nE^, = nEJ. If the polarization of the incident wave is the same as that of one of the two types 
of wave which can be propagated in the anisotropic (refracting) medium concerned, with the given direction of n, 
then there is only one refracted wave, which has this polarization. The problem is then formally identical with that 
of reflection from an isotropic body, and the fields E, and Eo in the reflected and incident waves are related by 


E, = (l-«)Eo/(l+«), 


(1) 


where n is the refractive index corresponding to this polarization. 

The linear polarization can be regarded as resulting from the superposition of two circular polarizations with 
opposite directions of rotation. If E^ in the incident wave is in the x-direction, we put E^ = Eq '^'+ Eo where 
= iEc*r - i Eo“, = - i£o >. = 2 ^o- Using formula (1) for each wave, with given by (101.19), we 
obtain 






1 -Up 

1 +no 


where 6 is the angle between the direction of incidence and the vector g. Hence we see that the reflected wave is 
elliptically polarized, the major axis of the ellipse being in the x-direction, and the ratio of the minor and major 
axes being {g cos fil/no (ho^ - !)• 

Problem 4. Determine the limiting form of the frequency dependence of the gyration vector at high 
frequencies. 

Solution. The calculations are similar to those in §78, except that the electron equation of motion must 
include the Lorentz force due to the static external magnetic field H: 


m — = eE^e '“‘H-ev'xH/c, 

where e = - | e | is the electron charge. If to ^ | e |f/, me, this equation can be solved by successive approximations. 
As far as terms of the first order in H we have 


§102 

j and the induction is then 
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D = £(£0) E + i/ (£0) ExH, 

i where e((u) is given by (78.1) and J (co) = — An Nj cm^ co^ = (|e|/2mf) de/dco (H. Becquerel, 1897). 


§102. Mechanical-optical effects 

Besides the electric-optical and magnetic-optical effects, there are other ways in which 
the optical symmetry of a medium can be changed by external agencies. These include, first 
of all, the effect of elastic deformations on the optical properties of solids. In particular, 
such deformations may render an isotropic solid body optically anisotropic. Such 
fehenomena are described by the inclusion in £,n (co) of additional terms proportional to the 
components of the strain tensor. The corresponding formulae are exactly the same as (16.1) 
and (16.6) for the static permittivity, except that the coefficients are now functions of 
I frequency. In the deformation of an isotropic body, for example, we have 

£.^ = £*0)^.^ + flj + 02 (102.1) 


The coefficients o, (to) and O2(co) are called elastic-optical constants. 

Another case is the occurrence of optical anisotropy in a non-uniformly moving fluid. 
The corresponding general expression for the dielectric tensor is 


(01 1 \ 


( 102 . 2 ) 


and represents the first terms in an expansion of in powers of the derivatives of the 
velocity. The condition that absorption be absent (e,^ is Hermitian) means that (co) and 
Aj ((7J) must be real; (co) is the permittivity of the fluid at rest. In an incompressible fluid 
Sv,/dx, = div V = 0, and the last two terms in (102.2) give zero on contraction. 

To investigate the electromagnetic properties of the moving fluid, we have to combine 
the formulae (76.9)-(76.11) for the electrodynamics of moving dielectrics (with a velocity v 
that depends on the coordinates) with (102.2). Here, however, the terms which contain both 
the velocity and its derivatives are to be neglected, as being outside the accuracy of the 
formulae. 

The second and third terms in (102.2) are respectively symmetrical and antisymmetrical 
the suffixes i,/c. For uniform rotation of the fluid we have v = x r, where fl is the 
^ogular velocity of rotation, and the symmetrical term is zero. The antisymmetrical term is 
' SO that the medium becomes gyrotropic. with gyration vector 


g = A2«- (102.3) 

quantity A2 contains contributions from two effects; the dispersion of the permittivity, 
^od the influence of Coriolis forces on it. 

_ In a frame of reference moving with a given element of the fluid, the amplitude of a 
Monochromatic wave (in the laboratory frame) rotates with angular velocity -SI, i.e. 
becomes a function of time satisfying the equation 

dEo/dt = -n xEq. 

In this sense the wave becomes quasi-monochromatic, and the relation between D and E is 
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given by 
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D = £(£o)E + 


.d£(co)^Eo 
' dco d? ^ 


(102.4) 


the derivation is the same as that of (80.10), except that here/(to) = £(£o). Substituting the 
value of dEo/df and comparing the result with the definition of the gyration vector g in 
(101.16), we find that the dispersion makes a contribution d£*°Vdco to Aj (M- A. Player 
1976). 

If we now put 

^2 = ^2^^ + d£*°Vdm, (102.5) 

then 22**^’ is due only to the Coriolis forces, which are linear in ft. 

In a rotating frame of reference, the Hamiltonian of the system is 

where # and ^^ech the ordinary operators of the energy and angular momentum of 
the system (see SP 1, §34); the permittivity of a rotating medium must in principle be 
calculated from this Hamiltonian. The expression is, however, analogous to the 
Hamiltonian of a system in a magnetic field, written as far as the terms linear in H: 

whereis the magnetic moment operator (see QM, §113). The analogy becomes exact if 
the contribution to the permittivity in the frequency range considered arises only from the 
orbital motion of the electrons in the atoms. Then M = (e/2mc) > where e = — | e I is 

the electron charge, and the two Hamiltonians differ only in the replacement of ft by 
£'H/2mc. In this case, therefore, it is clear that 

= 2mc/(co)/e, (102.6) 

where/(to) is given by (101.18) (N. B. Baranova and B. Ya. Zel’dovich, 1978).t 

The effects due to the coefficient are significant in such systems as suspensions and 
colloidal solutions of anisotropically shaped particles. In this case the effect is due to the 
orienting of particles suspended in the fluid by the action of the velocity gradients. Since a 
uniform rotation cannot orient the particles, it follows that in this case I2 ^ ^’^d tllp 

term in (102.2) may be omitted. The effect given by the term is called the Maxwell effect. 

Lastly, it may be noted that the 2i term in (102.2) does not satisfy the generalized 
principle of symmetry of the kinetic coefficients, which would imply that we have 
'') = £fci(m; — v), since v changes sign under time reversal. This is not necessary, how¬ 
ever. The reason is that the derivation of the principle presupposes that the processes des¬ 
cribed by the coefficients in question are the only cause of energy dissipation in the 
In the present case, however, besides the dissipation in the variable electromagnetic fie 
in the wave, there is another cause of dissipation, which is unrelated to the field, narne y 
the internal friction in the non-uniform fluid stream. As regards the theory of generalize 


t Here it should be emphasized that a non-zero may exist even in classical (not quantum) theory^ 
familiar proposition that, in the classical theory, the thermodynamic properties of bodies are independent 
Coriolis forces in uniform rotation (see SP 1, §34) applies only to the properties in statistical equilibriuni. 
permittivity when to ^ 0 relates to non-equilibrium Icinetic properties. 


The 

fthe 

The 
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susceptibilities, the term describes the response of the system to a non-linear interaction, 
the contribution to the induction from the field E and the velocity gradients jointly.t A 
uniform rotation of the whole fluid does not involve any additional dissipation; the Aj terrn 
in (102.2), which is found even with this kind of rotation, therefore satisfies the symmetry 
principle: (a; 12 ) = (co: — 12 ). 


PROBLEM 

Determine the rotation of the plane of polarization for a wave propagated parallel to the axis of a rotating 
dielectric body. 

Solution. The problem amounts to determining the gyration vector, which is the sum of two parts; the 
contribution (102.3) from the change in the permittivity and its dispersion, and the “kinematic” part due to the 
presence of the velocity in the relations (76.10), (76.11); the latter part is to be calculated. 

In Maxwell’s equations 

curl E = itoB/c, curl H = — loiD/c, div B = 0, div D = 0, (1) 

we express E and B in terms of D and H according to (76.10) and (76.11) with /i = 1, take the curl of the first 
equation, and use the remaining equations. The result is 

ew^ e-l iw(E-l) 

AD-(-—5 -Dh -curl curl (vxH)H- - -curl (Dxv) = 0; (2) 

here we write e in place of as used in the text of §102. Since all formulae are valid only as far as the first order in 
v, the higher-order terms are omitted. 

The last two terms in (2) give the required effect. We expand them and substitute v = D x r, obtaining 
curl (vxH)= -HxD, curl (D xv) = D xD. 

When these differentiations have been carried out, the coordinate dependence of all remaining quantities reduces 
to factors with k ||D according to the condition stated. Lastly, since in the zero-order approximation with 
respect to v we have H = ck x E/co, = eco^/c^, equation (2) becomes 

AD-H^ D-(-2ia)^Dxn = 0 


or 


(3) 


where = rand n is the refractive index in the rotating body. Comparison of (3) with (101.1 l)and (101.17) shows 
that the “kinematic” contribution to the gyration vector is 2 (e - l)D/a) (E. Fermi, 1923). The rotation of the plane 
of polarization of the wave is given by the total vector 


| 2(e-1) ^ 

1 CO ^ dm ^ 


|n the high-frequency limit, when the atomic electrons in the substance may be regarded as free, e is given by (78.1), 
ihe first two terms in the brackets cancelling. 


+ There exist, in principle, other such effects. For instance, in a conducting medium with no ce.. 

symmetry, there can exist me,i pseudiHensor terms of the form E - H or H,E^ + H.E where E and H are the 
static external fields (N. B Baranova, Yu. V. Bogdanov and B. Ya. Zel’dovich, 1977). It is important to note tha 
these terms, which formally violate he symmetry principle, can occur only in a conducting medium, where thi 
static electric field causes additional dissipation. 


sntre of 
e the 
: that 


CHAPTER XII 


SPATIAL DISPERSION 

§103. Spatial dispersion 

So FAR, in discussing the dielectric properties of matter, we have assumed that the 
induction D(t, r) is determined by the electric field E(t', r) at the same point r in space, 
though (in the presence of dispersion) not only at the same time t but at all previous times 
t' ^ t. This assumption is not always valid. In general, the value of D{t, r) depends on those 
of E(t', r') in some region of space about the point r. The linear relation between D and E is 
then written in a form which generalizes (77.3): 


here it has been written in the form applicable to an anisotropic medium. This non-local 
relation is a result of what is called spatial dispersion, in which connection the ordinary 
dispersion discussed in §77 is called time dispersion or frequency dispersion. Fpr 
monochromatic field components, whose dependence on t is given by the factor the 
relation becomes ~ 

Dfr) = £;(r)+ 4(co; r,r')£Jr')dF'. (103.2) 

It may be noted immediately that in the majority of cases spatial dispersion is much less 
important than time dispersion. The reason is that for ordinary dielectrics the kernel of 
the integral operator decreases considerably even at distances | r — r' | that are large in 
comparison with the atomic dimensions a. In this chapter we shall consider, as hitherto, 
macroscopic fields averaged over physically infinitesimal volume elements. These fields 
must by definition vary only slightly over distances ~ a. As a first approximation we can 
then take E(r') ^ E(r) outside the integration over d V in (103.1), and thus return to (77.3). 
In such cases the spatial dispersion manifests itself only in the form of small corrections. 
These, however, may lead (as we shall see) to qualitatively new physical phenomena an 
may therefore be significant. 

A different situation may arise in conducting media (metals, electrolyte solutions, or 
plasmas): the motion of free current-carriers causes non-locality over distances which may 
be much greater than atomic dimensions. There may then be important spatial dispersion 
even in the macroscopic theory.! 


t For an isotropic conducting medium, the o 
for the transverse and longitudinal permittivitii 
kernel of the integral in (103.2) is non-zero is tht 
and I their mean free path. For the longitudinal 
the distance travelled by the carriers by diffusioi 


onditions for spatial dispersion to be negligible may be ^ 

es. For the former, the characteristic distance over which 
; smaller of v/co and /, where v is the mean velocity of the 
permittivity, ro is the smaller of v/co and {Iv/cop, the latter hei e 
n along the field in a time ~ 1 /to; the diffusion coefficient D ~ ‘ • 


The spatial dispersion is unimportant if kr^ 1. 
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Another manifestation of spatial dispersion is the Doppler broadening of absorption 
lines in a gas. If an atom at rest has at the frequency cOq an absorption line with negligible 
width, then for a moving atom this frequency is shifted by the Doppler effect through a 
distance k • v, where v is the velocity of the atom (u c). This causes the absorption 
spectrum of the gas as a whole to contain a line with width Am ~ kvj, where is the mean 
thermal velocity of the atoms. In turn, this broadening means that the permittivity of the 
gas has an important spatial dispersion when k > Im —cOqI/u^. 

The following comment is needed regarding the form in which (103.1) is written. No 
arguments based on symmetry (in space or time) can exclude the possible electric 
polarization of a dielectric in a non-uniform variable magnetic field. It may therefore be 
asked whether a magnetic-field term should be added to the right-hand side of (103.1) or 
(103.2). This is not necessary, however. The reason is that the fields E and B cannot be 
regarded as completely independent. They are related (in the monochromatic case) by the 
equation curl E = icoB/c. By virtue of this equation, we can regard the dependence of D 
on B as a dependence on the spatial derivatives of E, i.e. as one manifestation of non¬ 
locality. 

When spatial dispersion is taken into account, it is often appropriate, and does not affect 
the generality of the theory, to write Maxwell’s equations in the form 

curl E = - - ^, div B = 0, (103.3) 

c ct 

curl B = ^ div D = 0, (103.4) 

without using H in addition to the mean magnetic field h = B. Instead, all terms which 
result from averaging the microscopic currents are included in the definition of D. The 
previous separation of the mean current into two parts as in (79.3) is in general not unique. 
In the absence of spatial dispersion, it is determined by the condition that P should be the 
electric polarization locally related to E. When there is no such relation, it is more 
convenient to put M = 0, B = H, and 

pv = c'P/dt, (103.5) 


corresponding to the form (103.3), (103.4) of Maxwell’s equations.! 

The components of the tensor ^*(co; r. r') (the kernel of the integral operator in (103.2)) 
satisfy the symmetry relations 

r, r') = )i,(a); r', r). (103.6) 


This follows from the same arguments as were given in §96 for the tensor £(^(£0). The only 
difference is that the interchange of the suffixes a and b in the generalized susceptibilities 
“ofc, which implies that of the tensor suffixes i and k and also that of the points r and r', now 
causes the interchange of the respective arguments in the functions 4 (co; r,r').t 


t We are, of course, referring to non-ferromagnetic bodies. Moreover, the following discussion relates only to 
'nfinite homogeneous media, and the form of the boundary conditions on the equations is not discussed. 

The above different view of the assertion in §79 that the permeability p has no 

mining at optical frequencies_ In that range, the effects due to the difference of p from unity are in general 
indistinguishable from those of the spatial dispersion of the permittivity. 

t As always in principle of symmetry of the kinetic coefficients, if the body is in 

an external “Genetic held or tos a magnetic structure, the right-hand side of (103.6) is to be taken with the field 
reversed or the structure time-reversed. 
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We shall consider an infinite macroscopically homogeneous medium. Then the kernel of 
the integral operator in (103.1) or (103.2) depends only on the difference p = r-r'. The 
functions D and E can then be usefully expanded as Fourier integrals with respect 
to coordinates as well as time, which reduces them to a set of plane waves whose depen¬ 
dence on r and t is given by a factor exp [i(k • r - cot) ]. For such waves, the relation between 
D and E is 

D, = £;,(co, k)£*, (103.7) 

where 

%(co, k) = ^* + 1 p)e‘*“"“‘‘-P‘dVdT. (103.8) 

In this description, the spatial dispersion reduces to a dependence of the permittivity 
tensor on the wave vector. 

The “wavelength” l//c determines the distances over which the field varies considerably. 
We can therefore say that spatial dispersion expresses the dependence of the macroscopic 
properties of matter on the spatial inhomogeneity of the electromagnetic field, just as the 
frequency dispersion expresses their dependence on the time variation of the field. When 
k-^0, the field becomes uniform, and accordingly £,;^(co,k) tends to the ordinary 
permittivity £j*(co).t 
From the definition (103.8), 

£;* (- £0, - k) = £„.* (£0, k), (103.9) 

a generalization of (77.7). The symmetry (103.6), expressed in terms of the functions 
£ik{co, k), now gives 

£ij£o, k; S) = eJw, - k; - S), (103.10) 

where the parameter §, denoting the external magnetic field (if any), is written explicitly. If 
the medium has a centre of symmetry, the components are even functions of k; an axial 
vector is unchanged by inversion, and (103.10) thus becomes 

£(^(£0, k; §) = £*,(£0, k; — §). (103.11) 

The spatial dispersion does not affect the derivation of equation (96.5) for the energy 
dissipation. The condition that absorption be absent is therefore again that the tensor 
£;*(£0, k) be Hermitian. 

When spatial dispersion is present, the permittivity is a tensor, not a scalar, even in an 
isotropic medium: a distinctive direction is generated by the wave vector. If the medium 
not only is isotropic but also has a centre of symmetry, the tensor must be constructed 
from the components of the vector k and the unit tensor (in the absence of a centre of 
symmetry, there may also be a term containing the antisymmetric unit tensor e^', see 
§104). The general form of such a tensor may be written as 

£;,(£0, k) = £,(£0, - k,kJk^) + ei((D, k)k,kjk\ (103.12) 

where e, and £, depend only on the magnitude of the wave vector (and on £o). If E is parallel 

t More precisely, the dependence on k disappears when kr^ < 1, where is the size of the region in which 
p) is significantly different from zero. 
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to the wave vector, then D = £,E; if E ± k, then D = £,E. The quantities £, and e, are 
kaccordingly called the longitudinal and transverse permittivities. When k -► 0, the 
■expression (103.12) should tend to E(a>)Sn, which does not depend on the direction of k; it 
is therefore clear that 

£,(£0, 0) = £,(£0, 0) = £(£0). (103.13) 

The description of the electromagnetic properties of an isotropic medium by means of the 
permittivities e, and e, corresponds to Maxwell’s equations written in the form (103.3) and 
(103.4). On the other hand, as k -► 0 and the spatial dispersion disappears, we can revert to 
the description by means of £ and p. There is consequently a certain relation between these 
I quantities (see Problem 1). 

The analogy between (103.8) and (77.5) enables us to apply to each component £(^(£0, k) 
as a function of the complex variable £o the results of the study of analytic properties in 
§§77 and 82. These are analytic functions with no singularity in the upper half-plane of £o, 
and they satisfy (for any fixed value of k) the Kramers-Kronig dispersion relations. The 
same is true of the functions E,{to, k) and £,(££>, k) in (103.12). Here it must be borne in mind 
that the function Ej with k 0 does not tend to infinity as £o -► 0 even in a conducting 
medium, and therefore no subtraction is necessary here as it was in deriving (82.9); the fact 
that £(£o) becomes infinite in a conductor as £0 -► 0 is due to the uniformity (k = 0) of the 
static field. 

The time average (as defined in §80) of the electromagnetic field energy density in a 
transparent medium with spatial dispersion is given by the previous formula (96.6). Since 
now /£ = 1, we have 

’ U = ^\^^^E,E,* + \B\A, (103.14) 

^ 167C L 0£U J 

E and B being assumed written in complex form. In the energy flux density in such a 
medium, an additional term appears: 

S=^re(E*xB)-:^^£,.*£,. (103.15) 

87c 1671 dk 

This formula is obtained by generalizing the derivation of (80.11): here, we have to 
consider a wave spread over a small range, both in frequency and in wave-vector direction 
(see Problem 2). 

PROBLEMS 


Problem 1. 


Find the relation between the functions e(a)), /j(to) and the limiting values of e,(ta, k) and e,{w, k) 


Solution. We compare the expressions for the averaged microscopic current pv in the forms (103.5) and 
(79.3). For a monochromatic field, we have in the first case 


Rnd in the second case 


[% (“. H £jk/47c. 


pv = - ko [£( 0 ) - 1 JE/47r + ic [p(a)) - 1 ]k x H/47r. 


Substituting in the first £.»(m k) from (103.12), and in the second H = ck x E/mp in accordance with Maxwell’s 
equation, and equating the two expressions (for k 0), we find 




£,(£0, k) — et{o)^ k) 


(>3 comparing the terms in k(k • E). Together with (101.13), this gives the required relatioi 
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Problem 2. Derive the formula (103.15) for the (time) averaged energy flux density in a medium with spatial 
dispersion. 

Solution. We start, as in §80, from formula (80.2). putting there H = B in accordance with the description 
of the field by equations (103.3) and (103.4), expressing all quantities in complex form and averaging over time- 


-div^ re(E X H») = ^ re(E* + B* • 

8;t 8;t ' dt dt J 


Let us consider an almost monochromatic plane wave with 


where Eo(t, r) is a function varying slowly in space and time. The derivative dDi/dt is written as with the 
operator 


(2) 


for a strictly monochromatic wave, 

fik^k =fikEy= — imeijlm, k)£j. 

Expanding Eq (t, r) as a Fourier integral with respect to time and coordinates, we can express it as a superposition 
of components Eo^qC'*''with a Wgand q -4 kf,. We then proceed as in the derivation of (80.10). Applying 
the operator (2) to the function 


Ecoo+«,ko + q = Eoaq exp [i(ko + q) ■ r - i(mo + a)t ] 


^ (oo + a, ko + q) Et„^++ , 


Now carrying out the inverse summation of the Fourier components, substituting f^ico, k) = - i<B£,t(a), k) and 
omitting the suffix 0 from (o„ and kq, we find 




tfon.k <■ [^ gf 


(3) 


The second term in the square brackets is the difference between this expression and (80.10). Substitution of (3) m 
(1) brings the latter to the energy conservation equation; 

d(7/dt = -divS, 

with U and S given by (103.14), (103.15). ^ 


§104. Natural optical activity 

If the spatial dispersion is weak, the tensor (co, k) can be expanded in powers of k. For 
ordinary solid or liquid dielectric media, the expansion is one in powers of a/ a, where a is 
the atomic dimension and A the field wavelength. 

As far as the first-order small terms, the expansion is 

£,k(a), k) =. -t- iymki, (104.1) 

where £*°f ^ 0), and y,k, is a tensor of rank three which depends on the frequency, as 

£0 -»0, its components (which do not pertain to any expansion in powers of co) tend to 
constants. The relation between D and E in a monochromatic field (oc; e *“')> correspon 
ing to (104.1), in the coordinate representation, is 

Di = e^°\^E^ + yMSEJBx,. 


(104.2) 
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Applying to (104.1) the symmetry condition (103.10), we have, in the absence of an 
external field, 

(104.3) 


In the absence of dissipation, the Hermitian condition gives yi^* = — hence we find 
7iw* = yod instead of (104.3). Thus the condition for absorption to be absent is that the 
I tensor 7;;^, be real. 

If we use the notation of §97 and express a plane wave vector as k = con/c, the 
expression (104.1) becomes 

£ik = tijc. (104.4) 

Instead of the antisymmetrical tensor of rank two we shall use the axial gyration 
r vector g, which is dual to it. This vector is given by 

(^yikini/c = Ciugi, (104.5) 

= + (104.6) 

which is formally the same as the expression used in §101. The only difference is that in 
§101 the vector g depended only on the properties of the medium (and on the applied 
magnetic field), whereas here the gyration vector depends also on the wave vector of the 
field. According to (104.5) the components of this vector are linear functions of the 
I components of n, i.e. 

9i = Sik^k- (104.7) 

i Substituting (104.7) in (104.5), we find coytunjc = e^^^g^n,, or, since n is arbitrary, 

(oyiu/c = Ci^^g^i, (104.8) 

which gives the relation between the components of the tensor of rank three and the 
pseudotensor g^,^ of rank two.t 

The particular crystallographic symmetry of the body places certain restrictions on the 
; components of the tensor yiu(OT gn^) and, in particular, may have the result that all the 
; components are zero. For example, the tensor y;;^, cannot exist in bodies having a centre of 
symmetry: when the sign of each coordinate is changed (inversion), all the components of a 
tensor of rank three (and of a pseudotensor of rank two) change sign, whereas by the 

I symmetry of the body they must remain unchanged by this transformation. 

Bodies in which the tensor g^^^ is not zero are said to have natural optical activity or 
, natural gyrotropy. Thus the existence of optical activity certainly implies that the body has 
I no centre of symmetry. 

Let us first consider the natural optical activity of isotropic bodies. If a liquid or gas 
consists of a substance having no stereoisomer, it is symmetrical not only with respect to 
any rotation but also with respect to reflection (inversion) about any point, and can have 
no optical activity. Such activity can occur only in fluids having two stereoisomeric forms, 
and the two forms must be present in different quantities. The fluid then has no centre of 
symmetry. 

In an isotropic body, and in crystals of the cubic system, the pseudotensor go, reduces to 


t In components - o>yg,y ~ etc. 
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Gik = Jiki = cfeijw. (104.9) 

A pseudoscalar is a quantity which changes sign on inversion of the coordinates. The two 
stereoisomers are formally converted into one another by the operation of inversion, and 
so their values of / are the same with opposite signs. 

Thus, in an optically active isotropic body, the gyration vector g = /n, and the relation 
between the electric induction and field in the wave is given by 

D = £<‘’»E + i/Exn. (104.10) 

Since D • n = 0, it follows that E • n = 0. That is, in such a wave not only the induction D 
(as in any medium) but also the field E is transverse to the direction of n. 

The change in the refractive index n when allowance is made for the natural optical 
activity is a small quantity. In determining this change we can therefore put n = Uq 
= in the small term E x g in (104.10). Then the problem of calculating the difference 
n —«(, is formally identical with that considered in §101 of the change in n due to the 
magnetic field, except that g has a different meaning and is always parallel to n (the z-axis in 
§101). By analogy with (101.19) we can therefore derive immediately the equation 

= no^±/«o- (104.11) 

These two values correspond (cf (101.21)) to the following ratios of the two components 
of E (or D): 

E^=±iE^, (104.12) 

i.e. to waves which are left-hand and right-hand circularly polarized. It may also be noted 
that the magnitude of n is independent of its direction, and therefore the direction of n is 
the same as that of the ray vector s. 

Thus we see that the optical properties of a naturally active isotropic body resemble 
those of an inactive body in a magnetic field: it exhibits double circular refraction, and 
when a linearly polarized wave is propagated in it the plane of polarization is rotated. The 
angle of rotation per unit path length of the ray is (af/2c. 

The sign of the constant /, and therefore the direction of rotation, are opposite for the 
two stereoisomers, and we therefore speak of dextrorotatory and laevorotatot}’ 
stereoisomers. * 

Unlike the rotation of the plane of polarization in a magnetic field, the magnitude and 
sign of the rotation in naturally active substances do not depend on the direction o 
propagation of the ray. Hence, if a linearly polarized ray traverses the same path in ® 
naturally active medium twice in opposite directions, the plane of polarization 
unchanged. 

Let us now consider naturally active crystals. We shall not give here a systematic analysis 
of all possible cases of symmetry (see the Problem), but simply note that natural activity 
impossible if a centre of symmetry is present, but possible if there is a plane of symmetry m 
a rotary-reflection axis. It should be emphasized that the conditions for the existence n 
natural activity in crystals are not the same as those which allow the existence of crystals in 
two mirror-image (enantiomorphic) forms; the latter conditions are more stringent, an 
require the absence of both a centre and a plane of symmetry. Thus a crystal can ^ 
optically active and yet be identical with its mirror image. 

In a naturally active crystal (uniaxial or biaxial), when light is propagated with an 
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arbitrary direction of the wave vector, we have essentially ordinary double refraction of 
linearly polarized waves; the allowance for the activity would amount essentially to 
replacing the strictly linear polarization by an elliptical polarization with an axis ratio of 
the first order of smallness. 

The only exception is formed by the directions of the optical axes, along which, if the 
activity is neglected, the two roots of Fresnel’s equation coincide. In these directions the 
[phenomenon of natural activity of crystals is analogous to that of isotropic bodies: double 
[circular refraction of the first order occurs, with a corresponding rotation of the plane of 
bolarization of linearly polarized waves. These phenomena rapidly disappear as the wave 
[vector deviates from the direction of the optical axis. 

I For a quantitative calculation of natural activity in crystals it is more convenient to use, 
not the expression giving D in terms of E, but the inverse, as in §101. As far as first-order 
quantities this is 

£. = £(«)-(104.13) 

where the vector G is related to the g previously used by G, = see (101.9). 

Owing to the formal correspondence between this expression and (101.7), the equations 
(101.11) and (101.12) are again valid. In these equations G^ is the component of G in the 
jpirection of n. If we write G in the form 

Gi = (104.14) 

in analogy with (104.7), the component is proportional to 

n • G = G,(104.15) 

This quadratic form determines the optical properties of a naturally active crystal. The 
tensor Go, itself need not be symmetrical, but if it is separated into symmetrical and 
pntisymmetrical parts the latter does not appear in the form (104.15). Thus we conclude 
that the tensor Gn, may be assumed symmetrical in discussing the optical properties of 
fcaturally active crystals. 


Find the restrictions imposed by crystal symmetry on the components of the tensor Gj^. 

I Solution. Under any rotation, the pseudotensor behaves as a true tensor; in particular, the presence of 
an ii-fold axis of symmetry with n > 2 results, as for a true symmetrical tensor of rank two, in complete isotropy 
in a plane perpendicular to the axis. The behaviour of the pseudotensor under reflection is determined by the 
fact that it is dual to a true tensor of rank three: under any reflection which changes the sign of a given component 
of a true tensor of rank two, the corresponding component of G,i remains unchanged, and vice versa. For 
example, on reflection in the yz-plane the components G,,. G^^, G„, G^^ change sign, but G^y, G,j do not. 

We give below the non-vanishing components of the tensor Gjj for all crystal classes which allow natural 
activity. The z-axis is taken along the threefold, fourfold, or sixfold axis of symmetry or (in the classes C2, €2^) 
along the only twofold axis of symmetry or (in the class C,) perpendicular to the plane of symmetry. When three 
J*nutually perpendicular axes of symmetry are present, they are the coordinate axes. 

Class C,; all. 

Class €2 - Gy,- G,., G the last ot which may be made to vanish by a suitable choice of the x and y axes. 
I. Class C,; G„, G,^, one o which may be madejo vanish by a suitable choice of the x and y axes. 

Class C21.: G„ (the xz and yz planes being planes of symmetry). 

I Class 02- G„, Gyy, C . - 

Classes C3, C4. = ^yy G„. 

Class 54: G„ = - suitable choice of the x and y axes. 

Class £>24: G,y (the x > g in vertical planes of symmetry). 

Classes T, O: G„ = G„ - tr,,. 
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It may be noted that, in uniaxial crystals of the classes 5* and the scalar (104.15) is zero if the vector n is in 

the r-direction, since G„ = 0. This means that in these crystals there is no natural-activity effect in the direction of 
the optical axis. 

In a biaxial crystal of the class C2, the optical axes are in one of the planes of symmetry. For vectors n 
lying in the xz or yz plane the scalar (104.15) is again identically zero, so that here also there is no effect in 
the direction of the optical axes. The only crystal class which allows rotation of the plane of polarization 
along the optical axis but not enantiomorphism is the monoclinic class C,. 


§105. Spatial dispersion in optically inactive media 

In crystals whose symmetry does not allow natural optical activity, the first terms (after 
the zero-order one) in the expansion of the permittivity £,^(01, k) in powers of k are 
quadratic terms. 

As usual in crystal optics, it is convenient in the subsequent analysis to write down this 
expansion for the inverse tensor — e~ We put 

>lik = + (105.1) 

The tensor may be regarded as symmetrical in the second pair of suffixes, since it is 
multiplied by the symmetrical product k,k^. From (103.10) (with § = 0) it is also 
symmetrical in the first pair of suffixes: 

Pik,. = Pmn, = PiM- (105.2) 

It need not, however, be symmetrical with respect to the interchange of both pairs. In the 
absence of absorption, since the tensor is Hermitian and symmetrical, the tensor is 
real, and we shall assume this to be so. 

In an isotropic medium, must be expressed in terms of the unit tensor only, i.e. must 

have the form 

Piklm = PAAn, + lp2i^iAn, + ^kAj-, 

it contains only two independent components. In an isotropic body we also have 
and so the tensor (105.1) becomes 

t}ik = (fl'''' + Pik^Ak + p2Kkk, » ( 105 - 3 ) 

in accordance with the general expression (103.12) for the dielectric tensor in an isotropic 
medium with spatial dispersion. The wave propagation in the medium is governed by the 
equations (97.21). On substitution of (105.3) in these equations, however, the anisotropic 
term in p2 disappears, since the vectors D and k are orthogonal in a plane wave, i.e. the 
medium remains optically isotropic, as it should. 

Even in cubic crystals, however, the tensor does not reduce to the unit tensor, 
depending on the crystal class, it has either three or four independent components. When 
spatial dispersion is neglected, cubic crystals are optically isotropic; the inclusion of the 
dispersion quadratic in k brings about a new property in them, namely optical anisotropy 
(H. A. Lorentz, 1878). 

In a cubic crystal and the expansion (105.1) becomes 
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Substitution of this in equations (97.21) gives 




(105.5) 

where «o^ = and the Xj-axis in Cartesian coordinates Xj, X2, X3 is parallel! to the wave 
ector. From the sense of the expansion (105.4), the second term in the square brackets in 
;hese equations is to be regarded as a small correction (see § 106 concerning the special case 
in which 1/no^ = 0). We can then replace n^ in that term by n^,^: 




(105.6) 


These equations have the same form as for waves in a non-cubic crystal when spatial 
ispersion is neglected. Their determinant is an expression quadratic in n~^, whose 
lishing determines the refractive indices of the two waves with the same direction of k 
but different polarizations. Thus the spatial dispersion in a cubic crystal removes the 
ilarization degeneracy”: the velocities of the two waves become different, and direction- 
lependent. 

The possibility of longitudinal electromagnetic waves in a transparent isotropic 
ledium has been mentioned at the end of §84. The consistent formulation of the 
mdition determining the relation between the frequency and the wave number (the 
lispersion relation) for these waves requires spatial dispersion to be taken into account; 
the condition is 

E,(m, k) = 0. (105.7) 


For small k, the solution of this equation is 

m(k) = m,o+iak\ (105.8) 

j where a is a constant and (u,o the frequency for which the permittivity e((u) = e,(o), 0). The 
!wave propagation velocity. 

, u = dco/dk = ak (105.9) 

is proportional to the wave vector. 


PROBLEM 

Find the relations between the components of the tensor in non-gyrotropic crystals of the cubic system. 

I Solution. Natural gyrotropy does not occur in the crystal classes T^, O^. 

I In the classes Tj and T^, we take the x, y and z axes along the three twofold axes of symmetry. The non-zero 
Components of the tensor are 

/?! ^ /?„„ = Pyyy, = p2 ^ P = Pyy,, = P^.yy 

I P3 ^ P.y.y = ^ 

J In the class O*, the three Cj axes become axes, and so we also have p2 = p^. 

§106. Spatial dispersion near an absorption line 

In §§104 and 105, sP^^ial dispersion effects have been regarded as small corrections. 
Which they usually are. T e situation is different, however, near a narrow absorption line in 
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a crystal, where, by (84.7), e<o>(£o) is sharply increased. In this region, the inclusion of spatial 
dispersion causes a qualitative alteration in the picture. 

The reason is that the addition to the permittivity of terms containing powers of k raises 
the order of the algebraic dispersion relation which determines the function k{co). The 
formal solution therefore gives rise to additional roots. Far from the absorption line, these 
lie in the range of large k, i.e. outside the region where the theory is valid, and they must 
therefore be discarded. Near the absorption line, however, the permittivity varies 
considerably even for small k, and additional roots may arise which have a real physical 
significance, i.e. new transverse waves may occur. 

For simplicity, we shall consider only isotropic media, and begin with the case where 
the medium is not gyrotropic, i.e. has no natural optical activity (S. I. Pekar, 1957; 
V. L. Ginzburg, 1958). 

As already shown in §105, an isotropic medium remains optically isotropic even when 
spatial dispersion is taken into account. This means that the dispersion relation for 
transverse electromagnetic waves in such a medium is given by the usual equation n^ = e, 
in which e is taken to be the transverse permittivity e,: 

n^ = E,(m,k). (106.1) 

It has been shown in §103 that e,((u, k) and e,(co, k) as functions of frequency satisfy the 
same Kramers-Kronig relations as e((u) does in the absence of spatial dispersion. We can 
therefore state that near the absorption line the function e,((u, k) has the same form as in 
(84.7) but with constants that become functions of k; we write it as 

A(k) 

If A is relatively small,! it may be meaningful to include not only the pole term but also the 
constant term (independent of co) in e,; we denote this by a{k), and assume that a> 0, i.e. 
that the medium is optically transparent far from the absorption line. The theoretical 
admissibility of including simultaneously the constant and pole terms in e, requires th^t 
they become comparable in the frequency range | to — to, | <^ to,, the only one where the pole 
expression is valid; we must thus have A < am,. 

Since k is again assumed small, the functions can be expanded in powers of k. Here it is 
sufficient to replace a(k) and A(k) by the constants a = fl(0) > 0, A s y4(0) ^ 0, retaining 
the correction term only in the expansion of m,{k) in the small difference m, — m: 

m,{k) = (Uq + vk^. 

Then the permittivity is 

E, (m, k) = fl +- 42 -• 

(Uo + vk-' - £0 

As a function of frequency, this is zero at a point in the range co >(u,. For k = 0, the zero is 
at £0 = £u,o = £Uo + A/a. Since the permittivities are the same when k = 0, £ti,o is the 
limiting frequency (as k 0) of the longitudinal wave; cf. (105.8). The zero of E,(£ti, k) when 
k =/= 0 has no direct physical significance. 


t For example, on account of some approximate selection rules which decrease the matrix elements 
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The dispersion relation (106.1) now becomes 

{n^-a)(Pn^-w + cjo) = A, (106.3) 

|vith p = vco^/c^ = vcOq^/c^, which may be either positive or negative. The solutions of 
(106.3) may be regarded as resulting from the intersection of two branches of the spectrum: 
(the ordinary light wave with n^ = a, and the wave with = (co — cOo)//? which arises from 
She pole of the permittivity. These branches are shown by the dot-and-dash lines in Fig. 58. 
their “interaction”, whose strength depends on A, moves the lines apart.t 




I The continuous curves in Fig. 58 show schematically the functions n^(co) given by the 
(roots of equation (106.3); the dashed curves are the functions n^(o)) when spatial dispersion 
is neglected {P = 0). Of course, only positive} roots n^ correspond to waves propagated in 
the medium. When /? > 0 (Fig. 58a), the upper continuous curve enters the region o) > cOq, 
and this gives rise to an additional wave which would not exist without spatial dispersion; 
when £0 > £0,0, two different electromagnetic waves can be propagated in the medium. 
fFigure 58b shows the function n^(£o) when /? < 0. To the left of the point m, whose 
coordinates are 

nJ = a + ^{A/\p\), w^-co^= - \p\[a + 2^{A/\p\)l 

and at which dn^/dco = od and the two roots coincide, there are two positive roots, and 
Iwo different waves can be propagated in the medium. The same is true of the region 


. t In the microscopic theory, the pole of the permittivity represents the existence of excitons, which are 
Jenientary Bose excitations in the dielectric; the sign of the constant ^ is the same as that of the effective exciton 
mass (see SP 2, §66). The corresponding branch of the wave spectrum is called the exciton branch. The part of the 
spectrum near the virtual intersection of the two branches is called the polariton range. 

t For clarity, the diagram shows negative roots also. The purely imaginary values of n correspond to waves for 
"'hich the medium is opaque (though there is no absorption m it); the wave may be said to be totally reflected 
from the medium. 
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between the point m', whose coordinates are 

n„.^^a-^{A/\p\), co„.-co^= -\p\la-2^(A/\p\)l 

and the point co = co^i. 

If A is not sufficiently small it is, strictly speaking, not consistent to retain the constant 
term in (106.2). On omitting this term (i.e. putting a = 0 in the above formulae) we get a 
picture that differs from Fig. 58 in that the horizontal asymptote of all the curves is the 
abscissa axis itself and not n^ = a. The range co > (u,o is not considered here. 

Let us now examine the situation near an absorption line in a gyrotropic medium 
(V. L. Ginzburg, 1958). The permittivity e*°*, when spatial dispersion is neglected, may be 
expressed by the pole term 

E°((u) = A/{o)o - co). (106.4) 

We shall not now assume that A is particularly small, and the constant term is therefore 
omitted. For the relation between E and D a formula of the type (104.13) is to be used, 
expressed in terms of the inverse tensor = e'L^. In an isotropic medium 

E = ^D + ifDxn, (106.5)1 

E* ' 

where the optical activity vector is written as G = Fn. Near the absorption line, the 
components of the tensor simply pass through zero, and there is nothing to prevent its 
expansion in terms of the wave vector from converging. 

The dispersion relation is 

= F^n\ (106.6) 

where cf (101.12). Substitution of from (106.4) gives 

J = F^n^. (106.7) 

In Fig. 59 the continuous curves show schematically the dependence of the roots of 
this equation on co — cOq. One exists both for co< cOq and for co > cOq, where there are no 
real values of n in the absence of spatial dispersion; this is (<«)> shown^by the dashed 
curve in Fig. 59. The other two exist only when co < co^. i.e. to the left of the point m at 
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which C0o-co„ = 3A(jF)i, nj- = (2/f)t The curves and n^^ico) lie above 

ni^(co) lies below it. Hence (as is clear from equations (101.11), which determine the 
induction D in the wave) waves 2 and 3 have circular polarization of one sign, and wave 1 
has circular polarization of the other sign. 

Lastly, it should be emphasized that formulae (106.2) and (106.4) for the permittivity, 
and therefore the results based on them, apply only to frequencies sufficiently far from the 
:entre of the line: | m - (Uq | > y, where y is the line width. When | m - (Uq | < y, absorption 
(the imaginary part of the permittivity) has to be taken into account, and this may modify 
ihe picture substantially. 



CHAPTER XIII 


NON-LINEAR OPTICS 


§107. Frequency transformation in non-linear media 

The theory of electromagnetic wave propagation in dielectric media described in the 
preceding chapters is based on the assumption of a linear relation between the electric field 
induction D and intensity E. This approximation is sufficiently accurate if (as is true in 
practice) E is much less than typical intra-atomic fields. Even then, however, the small non¬ 
linear corrections to D{E) cause qualitatively new effects and may therefore be important. 

The most important feature of a non-linear medium is the generation in it of vibrations 
with new frequencies. For example, if a monochromatic wave with frequency coi is incident 
on such a medium, then, as it is propagated in the medium, waves with frequencies mco^ (m 
being an integer) are generated; if there is initially a set of monochromatic signals with 
frequencies coi and CO2, the combination frequencies mcoi + na>2 will arise in the course of 
time, and so on. 

If the medium is non-dissipative, the frequency transformation process is subject to 
certain very general relationships, in addition to the obvious condition that the total energy 
of the vibrations at all frequencies must be conserved. Here it is assumed that the non¬ 
linearity is weak; the significance of this property will be made more precise later. 

The origin and significance of the required relations are most clearly seen from the 
quantum standpoint, and this will be our initial basis. To simplify the discussion, we shall 
suppose that all frequencies in the system can be represented as linear combinations of two 
incommensurate fundamental frequencies cOy and 0)2'. 

0)m„ = mcoy+ncoi, ( 107 . 1 ) 

where m and n are positive or negative integers. * 

The total energy of the radiation in the medium can be expressed as the sum of the 
energies of all the quanta: 


where is the number of quanta with frequency The summation is taken over all w 

and n for which co^ > 0 (since, of course, only positive frequencies are physically 
meaningful). 

Frequency transformation processes cause a variation in the numbers N^„ with time, the 
total energy being conserved. Hence 


2 are assumed incommensurate, and the numbers of quanta (and the 
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^ changes in them) are integral, the two sums must separately be zero;! 
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(107.2) 


Instead^f the numbers of quanta, we will use the corresponding intensities, i.e. the total 
energies in the radiation of the corresponding frequencies: 

The relations (107.2) then become 


df 


dt 


(107.3) 

(107.4) 


Here we must note a point that is especially clear in the classical picture of the oscillations. In 

[ referring to the time variation of we mean only the systematic variation; that is, we are 

considering the energy averaged over time intervals much longer than the periods l/coi, 
I/CO2, as indicated by the bar over in (107.4). This is where the non-linear effects need 
to be weak: the characteristic time t of the steady build-up which they cause in the 

( oscillations excited must be much longer than the periods mentioned. Only then can it be 
meaningful to consider the time variation of quantities averaged over intervals At such that 
l/coi, 1/0)2 < At<^x. 

Equations (107.4) give the required relations, and constitute the Manley-Rowe theorem 
(J. M. Manley and H. E. Rowe, 1956).J It remains to give them a more definite form, 
I avoiding the restriction > 0 in the summations. This is easily done by noting that, to 
each pair of numbers m, n for which o)^„ > 0, there corresponds a pair —m,—n for which 
the frequency is negative and | <!),„„ | is the same. Defining 

(107.5) 

and extending the summation to all integers from — 00 to 00, we double the sums, which 
are zero as before. Another simplification is then possible. In the first equation (107.4), we 
divide the sum over m into two, from 0 to 00 and from - 00 to 0, and in the second sum 
make the change m,n-* — m, — n; a similar change is made in the sum over n in the second 
equation. The result is 


£ I 


mo)i+nco2 dt 


-0, I 



mcu,+n(U2 dt 


(107.6) 


There is an obvious generalization to the case of a greater initial number of incom¬ 
mensurate frequencies. 

The specific properties of the vibrating system may prohibit particular frequency 
transfoiTnation processes. The summations (107.6) are taken, in practice, only over allowed 
processes. For instance, in the simple case of a system that permits only the generation of 
the combination frequency o)^ + 0)2, the numbers m and n take the values 0 and 1, and we 
find 


__ _ 1 d'2'o, _ 1 d^,, 

(Ml dt 0)2 dr ~ o)i-\-a)2 dr ’ (107.7) 


t It is, of course, conventional to write the rates of change of integers as derivatives, 
i The quantum interpretation is due to M. T. Weiss (1957). 
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The meaning of these equations is obvious: the decrease in the numbers of quanta and 

hco2 is equal to the increase in the number of quanta /i(oji +CO2) generated. 

§108. The non-linear permittivity 

When the non-linearity is weak, the first correction to the linear dependence of D on E is 
quadratic in the field. In the presence of time dispersiont it can be represented in any 
anisotropic medium by 

"" j> ■'^2 )£,(t - T 2 )dll dT2, (108.1) 

0 

an expression analogous to (77.3). Of course, the existence of such a term places certain 
limitations on the permissible symmetry of the medium; in particular, it does not occur if 
the latter is invariant under inversion. 

Although we shall be considering as typical a term of the form (108.1) quadratic in E, it 
must be noted that in the quadratic approximation the induction D may also contain terms 
bilinear in the components of E and H, or quadratic in H; these are usually less important, 
and they will not be discussed here. We shall also ignore the non-linear dependence of the 
magnetic induction B on H, because it is analogous to the dependence D(E). 

We define the quantity 

= J T2)dTidT2, (108.2) 

0 

which may naturally be called the second-order non-linear permittivity by analogy with the 
linear permittivity defined by an expression of the form (77.5). It differs only by a factor 
from u = ki called the non-linear susceptibility. Because of the symmetry of and 
E^ in (108.1), the tensor is symmetrical in the suffixes k and I if the arguments are 
simultaneously transposed:( ti, T 2) = fi ,k {tz, ^i)- The tensor therefore has the 
same type of symmetry: 

£i,u(‘^i’ ^2) = ^i.ik(<^2> <^i)- (108.3) 

In particular, when m, = CO2 the tensor is symmetrical in the last two suffixes: * 

Ei ki{0)-, to) = Ei ,^{co, co). (108.4) 

Moreover, since the functions^i are real, as follows from the definition (108.1) with real E 
and D, we have 

Eik,{ — (Oi, —CO2) = £i,w*(<yi, (Oz)- (108.5) 

The permittivity (108.2) arises naturally in the consideration of monochromatic fields or 
superpositions of these. In non-linear expressions such fields must of course be put in a rea 
form. For example, if E is a monochromatic field with frequency co. we must write E(f 
= re{Eoe”‘“'}, and substitution in (108.1) gives 

D..<^>(t) = ire{Ei.^,(co, co)c-^“'"£o.£oii + £.-,«(". -co)£oiEofc*}- 

t Spatial dispersion is neglected throughout this chapter. 
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This includes vibrations with twice the frequency (as well as a constant term corresponding 
to the frequency co — to = 0). In the general case, £iju((Ui, (U2) describes the contribution to 
the induction that is proportional to exp( —icojt) with coj = cOi +(U2- 
We shall discuss below only non-dissipative media, and refer to them as transparent, 

I although they are not literally such (for waves of a given frequency), because of the possible 
transfer of energy to other frequencies. We shall also suppose that the medium has no 
magnetic structure. 

First of all, we shall show that under these conditions the non-linear permittivity is real. 
This could be seen directly by expressing the components of the non-linear susceptibility 
tensor in terms of the matrix elements of the electric dipole interaction between the 
medium and the field, which acts as a small perturbation; the second-order susceptibility 

I ’ appears in third-order perturbation theory.j However, the reason why the result of such a 
calculation is real can be understood without actually doing it. The complete set of wave 
functions with respect to which the matrix elements are calculated may be chosen as real 
(for a medium having no magnetic structure, and therefore invariant under time reversal). 
The operator of the field interaction with the electric dipole moment of the medium is also 
real. Thus imaginary terms could occur only as a result of passing round poles due to the 
energy denominators in perturbation theory. However, the absence of dissipation in the 
medium signifies that none of the field frequencies coincides with a difference of energy 
levels in the system (or that the residues at the poles are zero because of certain selection 
rules); it is therefore unnecessary to pass round poles. 

The transparency of the medium also gives rise to particular symmetry relations for the 
tensor e,. ^. These also could be deduced from specific expressions obtained in perturbation 
theory. Here again, however, the required result can be derived in a simpler manner. To do 
so, we assume that the field in the medium is the sum of three almost monochromatic fields 
with incommensurate frequencies co,, CO2, co^'- 


E = El-t-E2-t-Ej 

= re{Eoie-'“-'+ Eo2e-'“^'+ EojC-'"^'}, 


(108.7) 


and (U3 = (Ui+(U2We shall suppose that the fields with frequencies co 1, (U2, cuj are created 
by external sources which are afterwards switched off; the slight non-linearity of the 
medium causes their amplitudes Eqi etc. to be slowly varying functions of time. 

This slowness allows Maxwell’s equations to be written down for the field of each 
fundamental frequency separately. In turn, these equations give in the usual way the energy 
conservation equation 

^(E7^-^Hi-Hi)-t-div^(EiXHi) = 0 


^nd similarly for suffixes 2 and 3; D j denotes the part of the induction D which contains the 
I factors and the bar denotes averaging over time (which will be needed later). On 

integration over the whole volume of the field, the divergence term vanishes, leaving 

^ j(E~^+ hT^h; ) d r = 0. 


t These calculations are analogous to, though considerably more labork 
susceptibility in second-order perturbation theory (see SP 1 , §126). 


than, that of the lit 


• generalized 
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If we now separate explicitly in Dj the terms linear and non-linear in E: 

Di = + 

then the former together with Hi • Hi give d ^i/df, the time variation of the field energy at 
frequency cOi. This variation is consequently determined by 


\t An I* 


rejEoie-'^i'} -Di'^MK 


(108.8) 


Here the derivative is to be expressed in terms of the field by (108.1) and (108.2) 

The time averaging reduces to zero all terms except those in which the exponential factors 
cancel. Repetition of the calculations for the other frequencies gives finally 


d^i icoi r ^ 

d ^2 *<^2 r 

~d/~ ~ ~ Ife J ~'‘^2)^03i*^0U^'02ldE+C.C., 

V = «^2)£o3^*£oi.£o2,dE-^c.c., 


(108.9) 


where c.c. denotes the complex conjugate. The property (108.3) has been used in these 
calculations. 

The total rate of change of the energy at all three frequencies is 


c^_d^ ^ d-^3 
df dt df df 


(108.10) 


In a non-linear medium this sum need not in general vanish exactly, because of the possible 
transfer of energy to other combination frequencies such as (U| — (U2 or coj -I- (U2: However, 
the fields at the frequencies £02,013 arising from the external sources do not depend on 
the degree of non-linearity, and need not be small, unlike the fields at other frequencies, 
which occur only because the medium is non-linear. The contribution of the latter to the 
energy balance may therefore be neglected, and the condition that the sum (108.10) be zero 
imposed. Moreover, since such a medium is a non-linear system with only three 
frequencies, we can apply to it the Manley-Rowe theorem in the simple form (107.7). In the 
notation used here, the relations are 

1 d#i 1 d^3 1 1 d#3 ^ 

££>1 dt (U3 at CO2 at ££>3 dt 

Substitution of (108.9) shows that the non-linear permittivity satisfies the following 
important symmetry relations!: 

<^2) = £it.ii(“<W3> <^2) = -<^3) (I08.ll) 

(J. A. Armstrong, N. Bloembergen, J. Ducuing, and P. S. Pershan, 1962). The symmetry 


t It is important to note that the complex amplitudes are arbitrary; the complex conjugate terms in (108.9) and 
(108.10) are therefore independent and can be equated separately. 
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expressed by these equations becomes more obvious if the tensor components are given a 
third argument so that the sum of all three is zero: 

ei.«(-"3; "2) = efc.iiK; -CO3, "2) 

= ei,ki("2;"i, -<^3) 

= ei,ut(-<y3;"2,"i); 

the last equation is (108.3). If we agree to relate the three successive argument frequencies 
to the three successive tensor indices, we can permute the latter in any maimer, provided 
that the arguments are similarly permuted. 

The requirement that dissipation be absent would by itself lead only to a weaker 
condition, that the sum (108.10) be zero: 

~<^l£k,il(-<^3,<^2)-<^2^l.lci(<^l’ -<^3) = 0- (108.12) 

The above derivation cannot be applied directly when co^ = CO2 = co, since the 
Manley-Rowe relations then reduce to just the conservation of the total energy. The 
equation 

£,. y(m, co) = efc.i,(-2m, co) = e,_y(co, -2co) (108.13) 

can, however, be derived simply on grounds of continuity by taking the limit of (108.11). 

If both frequencies co^ and CO2 tend to zero, the tensor becomes completely 
symmetrical. This symmetry expresses just the fact that in the static case the induction D 
can be found by differentiating the free energy with respect to £:£);= - 4ndF/dEf , and so 
dDj/dE^ = dD^ldEf. This shows that £; y is symmetrical in the suffixes i and k, and 
therefore in all three suffixes. 

If only one of the frequencies is zero, the relations (108.11) give 

£i_y(m, 0) = £,_y(m, -CO), (108.14) 

and 

£. y(ft,, 0) = £y„(-m, 0) = £y„(m, 0); (108.15) 

by (108.5), the real functions £^ ;,(a;, 0) are even functions of co. The tensor £,_«(£«, 0) 
describes a linear electric-optical effect, the change in the permittivity of the crystal in a 
static electric field, and it is therefore the same as the tensor ajy defined in (100.4): 

0) = a,.y(m); 

from (108.15), it is symmetrical in the suffixes i, k, as it should be. The tensor s,j^i(co, —co) 
describes another effect, the presence in the medium of a static dielectric polarization 
proportional to the square of the applied weak periodic field; compare the second term in 
(108.6). The equation (108.14) thus states the relation between these two effects. 

By similar arguments using the non-linear susceptibility, which is a “cross” between 
electric and magnetic quantities, we could recover the relation between the magnetic- 
optical Faraday effect and the magnetization of a medium by a rotating electric field as 
given by (101.15) and (101.25). 

As already mentioned, for media invariant under spatial inversion, there is no second- 
order non-linearity. In such cases, the non-linear effects begin with the cubic terms in the 
expansion of D(E). The corresponding third-order non-linear permittivity is a tensor of 
rank four, a function of three independent frequencies, Ei u„(co^, CO2, co^). Its symmetry 
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properties are exactly analogous to those of the second-order permittivity tensor: if we 
define a fourth frequency o}^ = o}^-\-o}2+co3 and write it as e,- (- 04; cui, 02. <^3), then 
the suffixes can be permuted in any manner if the four arguments are permuted similarly. 

The third-order non-linearity may be important even when there is quadratic non¬ 
linearity, owing to the specificity of the effects it causes. 


§109. Self-focusing 

In this section we shall discuss optical effects arising from the non-linear variation of the 
field at the primary wave frequency. That is, we consider the non-linear contribution to D 
at the frequency w of the monochromatic field E. In the quadratic terms there is no such 
contribution, as they contain only the frequencies 2co and zero. The first non-zero effect 
comes from the cubic non-linearity and occurs in terms having the form EEE* (frequency 

CO + CO — CO = co). 

In the rest of this section, the medium (liquid or gas) will be assumed isotropic. The 
relevant third-order terms in the induction then have the general form 

D<^' = a(m)|EpE + iS(m)E"E*; (109.1) 

they involve two independent coefficients, which in a transparent medium are real even 
functions of the frequency. This number of independent coefficients is in agreement with 
the symmetry properties of the tensor e; y„( —ca; —co,co,co). With these values of the 
arguments, the tensor is symmetrical in the pairs of suffixes i, k and /, m; in an isotropic 
medium, such a tensor has two independent components. In the low-frequency limit, as 
noted in §108, the tensor must be symmetrical in all suffixes, and in an isotropic medium 
therefore proportional to the combination ■ ^^is means that 

a(0) - 2)3(0). (109.2) 

The expression (109.1) can be simplified in the case of a linearly polarized field E. With 
this polarization, the complex vector E reduces to a real vector multiplied by a common 
phase factor; the expressions |E|^E and E^E* then become 

D'"> = (a + )S)|E|"E. (109.3) 

A similar simplification occurs when the field E is circularly polarized; then E^ = 0,^nd 
(109.1) reduces to 

= a|EpE. (109-4) 

In either case, the induction is polarized in the same way as E. In the general case of 
elliptical polarization, however, the directions and ratios of the principal axes of the ellipses 
are not the same for E and D‘^*. 

The relation D = D“> + D‘^' = eE + D‘^>, where e(a;) is the ordinary linear permittivity, 
must be substituted in Maxwell’s equations, which are to be written (eliminating the 
magnetic field H) as 

curl curl E + 1 ^ = 0, (109-5) 

div D = 0. (109.6) 

It is important that these non-linear equations allow an exact solution as a monochro- 



§109 

matic plane wave. 


Self-focusing 


379 


E = Eoe'<'‘-'-“9 (109.7) 

with linear or circular polarization: for such waves |E|^ = |Eo|^, so that formulae (109.3) 
and (109.4) are of the same type as in the linear case with the permittivity depending on the 
field amplitude; we can therefore take the real part after solving the equations. The relation 
between D and E in these cases will be written as 


D = (^e+^»/|Eol"^E, (109.8) 

with the notation (which will later be convenient) >/ = a;^(a + )S)/2c^ for a linearly polarized 
wave or t} = ofaj2c^ for a circularly polarized one. 

Substitution of (109.8) in (109.6) gives div E = 0, the field remaining transverse as in the 
linear theory. When this is taken into account, the substitution of (109.7) in (109.5) yields 
the dispersion relation 

+ 2f/|Eo|^ (109.9) 

The phase velocity co/k now depends on the wave amplitude as well as on the frequency. 
If f/ > 0, the phase velocity decreases with increasing amplitude, and we have a focusing 
medium (the significance of this term will be explained later). If >/ < 0, the phase velocity 
increases with the amplitude, and this is a defocusing medium. 

The use of the non-linear relation (109.1) presupposes, of course, that there is only slight 
non-linearity: the higher-order terms must be small in comparison with the terms in 
Qualitatively new effects may arise by the “build-up” of non-linearity effects over long 
intervals of time and over large distances. A natural formulation of the problem here is to 
consider an almost monochromatic wave. 


E = Eo(t,r)e‘<*^o*-“9 (109.10) 

where Eo(t, r) is a slowly varying function of time and coordinates, which has only a slight 
relative change over times ~ l/co and distances ~ l/k(y. The wave vectors that occur in the 
Fourier expansion of this field are distributed over a small range of values about the vector 
1^0, which is in the x direction and whose magnitude we regard as related to co by 

= a}h{c))/c^, (109.11) 

3s in the linear theory. We now derive an equation for Eplf, r). 

First of all, we note that in (109.5) grad div E is negligible in the term curl curl E = 
grad div E — AE: by (109.6), the divergence of the field is 


divE= -f/(2cVem^)Eo-grad|EoP; 

it is therefore non-zero only through the derivatives of the slowly varying function Eq, and 
•t IS additionally small because the non-linear terms are small; such quantities are 
negligible. We thus have 


curl curl 




where Aj. = d^/cy^ + S fdzf the term in the second derivative d^Eo/Sx^ is omitted since 
It does not contain the large factor . The transverse derivatives, however, may be large in 
comparison with the longitudinal ones. ^ 
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The calculation of is similart to the derivation of (80.10) and gives 

1 1 r 2 / ^Eo”I ... 

.-(VE. + 2,-^|5yv.-.., 

with the group velocity u defined by 

1 _ dlco _ 1 d{o)^e) 


dca c dco 

In the derivative of D'^*, it is sufficient to retain the term 
1 


dt^ 


-= -2f/(a;)|EopE, 


(109.12) 


neglecting the term in the small derivative dlEo/dt. 

Substituting these expressions in (109.5), we have finally 

ifco(|^ + ^^)Eo= -iA^Eo-»/(m)|Eol^Eo. (109.13) 

The combination of derivatives on the left expresses the fact that the amplitude 
perturbations are transmitted in the direction of propagation of the wave, at the group 
velocity. 

With this equation, we can examine the stability of an infinite plane wave described by 
the exact solution (109.7), (109.8) (V. 1. Bespalov and V. I. Talanov, 1966). We shall see that 
the wave is unstable in a focusing medium.} 

According to (109.9), in the exact solution (109.7) k = lco + f/£o^/fco, with Icq from 
(109.11); in a linearly polarized wave, the amplitude Eq may be defined as a real vector. 
Hence, if we write the wave (109.7) in the form (109.10), we must put in the latter 
Eo(^) = Ep exp(ixf/£o^/lco). This acts as the amplitude of the unperturbed wave. We shall 
consider the steady-state problem of the spatial evolution of perturbations along the 
direction of propagation of the wave. Accordingly, the amplitude of a wave subject to a 
small perturbation is written 

Eq (r) = (Eo + 5E(r)} exp(ixf/£o^/^o)- (109.14) 

We shall assume that dE is parallel to Eg. 

Substitution of (109.14) in (109.13) gives 

ikodSE/clx = -i A^5£ - »/£o" {SE + 6E*). (109.15) 

We put 

5£ = + (109.16) _ 

where q is a vector in the yz-plane. Substituting this expression in (109.15) and collecting 


t The only difference is that here/= f((o) — —co^e(o)). 

i This was earlier noted by R. V. Khokhlov (1965). 
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the groups of terms in exp{ +/(q-r + yx)}, we obtain the two equations 

= 0 , 

- tjEffA + - i]E(f — k(yy)B = 0 . 

The condition that their determinant be zero gives 

y = ± {q/2ko)y/(q^ - ^Ef^). 

When »j > 0 and 

q^ < AqEf^, (109.17) 

y is imaginary, so that 8E in (109.16) contains an exponentially increasing term, and the 
wave is therefore unstable. The maximum instability growth rate is of the order of the non¬ 
linear correction to the wave vector. 

One manifestation of this instability is the self-focusing of a light beam of finite width 
propagated in a focusing medium. The phenomenon occurs because, if the field amplitude 
decreases from the axis to the periphery of the beam, the permittivity of the medium (which 
depends on the amplitude) also decreases in that direction (if t] > 0), and the medium 
behaves as a focusing lens (G. A. Askar’yan, 1962). The behaviour of the beam is 
determined by the interplay of two opposing tendencies, namely this focusing and the 
broadening due to diffraction. 

We shall show, first of all, that these tendencies may cancel out, in the sense that 
(iff/ > 0) equation (109.13) has a solution in the form of a steady beam that undergoes no 
broadening. This self-channelling is a specifically non-linear effect. In the linear theory, any 
beam with a finite cross-section is broadened by diffraction. We shall take only the one¬ 
dimensional case where the field E depends only on one transverse coordinate y and is 
polarized in the z-direction and propagated in the x-direction.f An analytical solution can 
then be obtained (V. I. Talanov, 1965). We here ignore the fact that a beam of infinite width 
(in the z-direction) is certainly unstable, because perturbations with small q^ can occur in it, 
which are unstable by (109.17). 

We write 

£oz = F{y)e‘’<^ (109.18) 

with a small quantity k which acts as a correction to the wave number k^. The function F(y) 
is real. Substitution in (109.13) gives for it the equation 

id"F/d/ = kofcf-f/E^ (109.19) 

The first integral is 

zfdF/dy)^ — k(,KF^ = constant. 

are interested in the solution for which F and dF/dy tend to zero as |y| ->■ co. 
Accordingly, we take the constant to be zero, and a straightforward integration then gives 

F = (IkoK/gf^ sech[(2koK:)‘'2>'], (109.20) 

>’ being measured from the centre of the beam. The width of the beam in the >-direction is 
^ ~ Vv/f/FfO). 

t Under these conditions, the term grad div E, neglected previously as being small, is identically zero. 
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Since the energy flux along the beam is ~ E^{0)8, 8 is proportional to 1/H^ and the 
beam becomes narrower as the power carried by it increases. 

A self-channelled beam of this kind is a special case where the focusing properties of the 
medium are exactly balanced by diffraction. Other beams may either diverge or converge. 
First, let us derive a qualitative criterion for self-focusing of an actual beam with a finite 
cross-section (R. Y. Chiao, E. M. Garmire, and C. H. Townes, 1964). This can be done 
immediately by using the instability condition (109.17). In a beam with characteristic 
radius R, perturbations can occur whose wavelengths transverse to the beam axis are less 
than R, i.e. for which q > l/R. The condition (109.17) gives the upper limit of q values 
which lead to instability. The beam will therefore be unstable with respect to focusing if 

Eo^R^t] > 1. (109.21) 

The power carried by the beam is the product E^^R^. The critical value of this power 
beyond which self-focusing begins is independent of the cross-sectional area of the beam. 

It is also possible to establish an exact (not just order-of-magnitude) sufficient condition 
for self-focusing of a beam (S. N. Vlasov, V. A. Petrishchev, and V. I. Talanov, 1971). 

For a steady linearly polarized light beam, but with no initial assumptions about the 
dependence on x, the equation for the function E^ (x, p) is 

ikodEoldx= -\Aj_Eo-t}\Eo^\Eo, (109.22) 

p being the two-dimensional position vector in the yz-plane, in which the differentia! 
operators A x and gradx act. It is easy to verify that this equation gives 

5|£oP/5^ + divxj = 0, (109.23) 

where 

j = (i/2fco)(£o gradx gradx ^o)- 

Hence, in turn, it follows that the integral 

N= f lEol^dV (109.24) 


is “conserved” (i.e. independent of x), as is » 

<r = ^ j (IgradxEol" -q\Eo\^}d^P, (109-25) 

a result which is derived immediately by differentiating with respect to x and using (109.22)- 
We assume, of course, that Eq decreases sufficiently rapidly as p ->■ oo, so that bo 
integrals, as well as (109.26) below, converge.t 

We will show that the behaviour of the beam is determined by the sign of the ^ 

When(f > 0 the beam is divergent on average; when<r < 0 it is focused. The proof is base 


t As regards the derivatives present, equation (109.22) is similar to the two-dimensional 
squation, with x in place of the time. In this analogy, N and S act as the particle number and the energy. 1 ne 


5t affect the derivation of these c< 
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K^(x) = ^ (109.26) 

To derive this, we write, using (109.23), 

I ^||£olV^dV = -Jdiv^j-pMV = 2|i-pdV 

j Differentiating again with respect to x, we substitute dE^/dx from (109.22) and integrate 
twice by parts, obtaining Nd^R^fdx^ — 4 S. Hence 

R\x)==2S{x-Xof/N + Ro\ (109.27) 

where x^ and R^ are constants. We see that, when < 0, complete focusing of the beam is 
attained at a finite distance in the direction of propagation, its radius R becoming zero.t 
This result, derived with the approximate equation (109.13), caimot have a literal 
physical significance near the focus itself, where the assumptions made in obtaining that 
equation are invalid. We need only mention that, when the field energy density increases 
without limit in exact focusing, there is no justification for retaining only the lowest (the 
third) power of the non-linearity. However, the possibility of self-focusing of the beam, to 
such an extent that the non-linearity is no longer small, is important. It should be 
! emphasized that the condition derived is sufficient but not necessary. A beam with < 0 
certainly undergoes complete focusing, but the divergence on average of a beam with 
^ 0 is not inconsistent with the focusing of some internal part of it. 

§110. Second-harmonic generation 

In §107, only some general relations have been considered which relate to frequency 
I transformation processes characteristic of non-linear optics. We shall now describe the 
I quantitative theory of a typical process of this type, namely second-harmonic generation, 
i.e. the excitation of an electromagnetic field with frequency Ico by one with frequency co 
(R. V. Khokhlov, 1960; J. A. Armstrong, N. Bloembergen, J. Ducuing and P. S. Pershan, 
1962). 

Second-harmonic generation is a non-linear second-order effect. It resides in the non¬ 
linear susceptibility tensor 

ei.*,(-2m;m,m) (110.1) 

and therefore does not occur in media which allow spatial inversion. The tensor (110.1) is 
I symmetrical in the suffixes k and 1; its symmetry properties in various crystals are the same 
as for the piezoelectric tensor (§17). We shall assume that there is no absorption in the 
medium, so that the e,. ^ are real quantities. 

The problem of second-harmonic generation may be stated as follows Let a 
I monochromatic plane wave of frequency « be incident on the plane surface of a crystal. As 
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well as the reflected and two (in a doubly refracting crystal) refracted waves with the same 
frequency, a reflected wave and refracted waves with frequency Ico are also formed. The 
waves with this frequency in the crystal constitute the solution of equations (109.5) and 
(109.6), in which the non-linear term D‘^’ in the induction is to be expressed in terms of the 
fundamental wave field. The amplitudes of all these waves are expressed in terms of the 
incident wave amplitude by means of the boundary conditions, which will not be discussed 
here. The amplitudes of the waves with frequency Ico are, of course, small in proportion to 
the non-linear susceptibility.f 

The refracted waves are propagated into the crystal as if it were infinite. The non¬ 
linearity effects build up during this process, and the harmonic intensities may become 
large, with energy transferred to them from the fundamental frequency. This is the process 
to be considered here. The conditions at the crystal surface then act only as “initial” 
conditions which specify a small but non-zero amplitude of the second-harmonic field. The 
same conditions specify, for a given direction of the incident wave, the wave vectors kj and 
k2 of the first and second harmonics in the crystal. 

It will be seen later that an effective energy transfer occurs only if the synchronism 
condition holds for the fundamental and the harmonic:} 

( 110 . 2 ) 

It should be emphasized that the presence of dispersion is of fundamental importance in 
formulating the problem of the generation of just one second harmonic. When dispersion 
is absent, the condition (110.2) is necessarily satisfied in refraction, as well as similar 
conditions for the higher harmonics also (kj S 3ki, and so on). When dispersion is 
present, this is not so, and it can be assumed that if the synchronism condition is satisfied 
for the second harmonic it is not satisfied for the others. It should be emphasized that the 
condition (110.2) itself can in practice be satisfied only if the fundamental wave and the 
harmonic have different types of polarization and therefore different dispersion relations. 

The field in the medium may be written as a superposition of two waves: 

E = El -y E2 = re[ei£io^’'''‘'‘^““'’ + e2£2o^’‘<‘‘^’''"“'^]. 
where, according to the condition (110.2), 

k2 = 2ki4-q f 110.4) 

with q<ki. The wave amplitudes are written as products Eq = e£o, with e a unit 
polarization vector (e*e* = 1). In the linear approximation these amplitudes would e 
constant; when the non-linearity is taken into account, they are functions of t e 
coordinates which vary only slowly (i.e. only slightly over distances ~ 1/k, ). 

The equations for the amplitudes of the two waves are found by substituting (110. ) 
Maxwell’s equations (109.5), (109.6), and collecting terms with the same time dependence. 
We shall not go through these straightforward but laborious calculations in detail, but give 
only some comments on fundamental points. 


t The calculation of the reflection and refraction conditions at the boundary of a non-linear 
discussed for several particular cases by N. Bloembergen and P. S. Pershan, Physical Review 128 606, ' 

t The nature of this condition is particularly clear from the quantum standpoint, the second-harm J 
generation being regarded as a “fusion” of two photons into one. The equation = 2fik, expresses the fact 
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We shall seek solutions describing the steady generation of the second harmonic in the 
crystal by a travelling fundamental wave. Such solutions are independent of time. With 
exact synchronism (q = 0) the equations for the amplitudes would not explicitly contain 
the coordinates at all; with inexact synchronism, the coordinates appear only as q • r in the 
factor Taking the direction of q as the z-axis, we can therefore look for solutions that 

depend only on z. With the above formulation (a wave co incident on the surface of the 
crystal), the problem is homogeneous in planes parallel to the surface. The z axis is 
therefore perpendicular to the surface; the vector q necessarily is so also, according to the 
boundary conditions. 

In the linear approximation, waves in an anisotropic (non-gyrotropic) medium are 
linearly polarized (see §97); for these e can be defined as a real vector, and such will be 
understood here by Cj and 62 for the two waves. If the amplitude Eq of each wave is resolved 
along the directions of e, k and e x k, then the components in the two latter directions are 
small because the derivatives d£o/dz (which appear as a result of the non-linearity) are 
small. The components along e are approximately equal to the magnitudes Eq. Equations 
for them are obtained by multiplying equation (109.5) by Cj and by 62. Since waves with Eq 
= constant are exact solutions of Maxwell’s equations in the linear approximation, the 
linear terms in these equations that do not involve derivatives with respect to z cancel out. 
The terms involving the components of in the directions of k and e xk, which might be 
of the same order as those containing the derivatives dE^/dz, are found to disappear when 
the multiplication is carried out; this occurs because the induction D is orthogonal to both 
kand exk (see (97.3)). 

Since the amplitudes are assumed to vary only slowly with the coordinates, we can 
neglect the second derivatives of Eq with respect to z. Hence, for example, the expression 
f ) 

C 2 ij (curl curl)i*--p-e,^(2a;) > c 2 fc£ 2 oc''‘^'', 

in the equation for E2 multiplied by €2, becomes approximately 


2ie2 • [k2X(|xe2)]d£2o/dz, 

where i is a unit vector in the z-direction, and similarly for Ej. 
The final equations obtained by these procedures are 


with the notationf 


a2d£2o/dz = — if/c 
aid£io*/dz = ir}e~“‘^E^QE 20*, 

t] = ico^/2c^)eiJa(co,co)e2ie^,^e^l, 

«i = ••[eiX(kiXei)], 

“2 = il-[e2X(k2Xe2)]^ l•[e2X(klXe2)]. 


(110.5) 

( 110 . 6 ) 

(110.7) 


Multiplying the first equation (110.5) by E20* and the second by £,„ and adding, v 
obtain a first integral: 

+a2l£2oP = constant = P. (110.1 


t The first equation (110.5) is derived from the terr 
(108 13) have been used on the right-hand sides. 


and the second from those in ( 


. The relations 
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This expresses the fact that the total energy flux in the two waves in the z-direction is 
constant.f 

It is now convenient to change from complex quantities to real ones, namely the absolute 
values and phases of E^o and £20- To make the equations as simple as possible, we define 
new unknowns Pi, P2, 4>i, 4>2 dimensionless quantities by 

The equations (110.5) are invariant under the transformation </)i -* (pi+c,(l)2-* (j>2+ 2c. It 
is therefore possible to separate the equations for the functions Pi, P2 and the invariant 
combination 2</)i — </)2, to obtain a closed set of equations 


dpi/dC = -piP2sin0, dp2/d^ = Pi^sinfi, 

(110.10) 

de/dC= -s-(2p2-PiVP2)cos0, 

(110.11) 

where 

e = 2</)i -(j)2-sC, 

(110.12) 

the dimensionless variable 

c = zrj^{P/a,^a,), 

(110.13) 

and the dimensionless parameter 


s = {qln)J{a.i^a. 2 lP). 

(110.14) 

The first integral (110.8) then becomes 


Pi^ + P2^ = 1- 

(110.15) 


Let us consider the case of exact synchronism, where q = 0 and therefore s — 0. Then 
equations (110.10) and (110.11) have another first integral 


Pi^P2 cos 6 = constant = 5, (1 IO.I6) 


where 8^ ^ 4/27, as is easily seen from equations (110.15) and (110.16) and the condition 
lcos6| ^ 1. With these two integrals, the solution of equations (110.10) is reduced to 
quadratures, in the form of the elliptic integral 






(110.17) 


f„(l_„)2_52]l/2^ 

the choice of sign depends on the sign of the initial value of sin 6 for C = 0. The cubic 

„(1_„)2_52^0 (110.18) 


equation 


t The time-averaged energy flux density in the Ej wave is 

S„ = crel-[E,o*’<H,o]/8n 

= cM. [E,o* xtki xE,o)]/8rt" 
= c^a, 


and similarly for the 
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has (when 6^ ^ 4/27) three real positive roots, two of them less than unity, which we denote 
by and with p^ ^ The functionp 2 ^(C) defined by (110.17) varies periodically 
between these values, the period being 


The function Pi^(C) 
maximum. 



The value p^ is to be identified with the second harmonic P2^(0) given by the boundary 
; conditions at the crystal surface (z = 0). We see that, in the z-direction into the crystal. 


there is a periodic transfer of energy from the fundamental to the second harmonic and 
back. When P2(0) decreases, the period of this process becomes longer, tending 
|logarithmically to infinity as P2(0) -* 0. The limiting value P 2 (0) = p^ = 0 corresponds to 


the solution 

Pi = sech C, P 2 = tanh C, 


( 110 . 20 ) 


I which is obtained from (110.17) for ^ = 0 by elementary integration; in this solution, the 
pecond-harmonic amplitude increases monotonically, and as l^-r co all the energy is 
jasymptotically transferred from the fundamental to the harmonic. 

I Let us now consider the opposite case where the amplitude P 2 remains everywhere small 
in comparison with Pi. We shall see that this corresponds to a considerable loss of 
Isynchronism of the waves. 

When P 2 < Pi, we can as a first approximation regard Pi as a constant Pi (0), and write 
the equations for P 2 and 6 as 

dp 2 /dC = pi"(0)sine, de/dC = -s+ [pi"( 0 )/p 2 ]cose. 

The solution of these equations that is zero at the initial point C = 0 is 

P2(0 = (2/s)pi^(0)sinisC, e = (110.21) 

These give the variation of the field over one interval 0 ^ C ^ 2n/s (i.e. 0 ^ x ^ 2nlq), after 
which the process is repeated periodically.} The condition P 2 Pi signifies that we must 
have Pi^(0)/s Pi(0), i.e. s P Pi(0), or 

qzo ?> 1, Zq - 1/f/Pi (0)^ P. 

This is the condition for a comparatively large loss of synchronism. The value of q 
, determines in all cases which effect limits the generation of the harmonic (i.e. the increase in 
the amplitude P 2 ) whether it is the linear effect of loss of synchronism when qz^ P 1, or 
the non-linear effects when qz^ <1 l.§ 

Hitherto in this section, we have everywhere referred to second-harmonic generation 
from the fundamental frequency. The equations also describe the converse effect, however. 



388 Non-linear Optics 

namely the parametric amplification of a weak signal with frequency <x> in the field of strong 
radiation with frequency 2a). Here we shall consider only this one process, but it is the 
simplest case of a more general phenomenon, the amplification of signals with different 
frequencies W2 and ajj—0)2 in the field of a strong wave with frequency ojj 
(S. A. Akhmanov and R. V. Khokhlov, 1962; R. H. Kingston, 1962). 

We should first of all emphasize the following difference between this process and 
second-harmonic generation. The latter can begin from zero intensity of the harmonic. The 
amplification of the fundamental, however, depends on a non-zero initial intensity of it, 
however small: if pj(0) = 0 at the starting-point, then it will remain so, for equations 
(110.10) show that the derivatives of pj and p2 of all orders vanish with p^. 

Let us again consider the case of exact synchronism, and in addition let the initial value 
of the phase variable be 6(0)= -jn; with exact synchronism, this value will be maintained. 
Then ^ = 0 since cos 6 = 0, even though the initial values of Pj and P2 are not zero. The 
solution of equations (110.10) in this case is 

Pi = sech(C-Co), P2 = -tanh(C-Co), (110.22) 

where Cq > 0 is a constant. When Cq is large, the initial value p 1 (0) = sech Cq is small. We see 
that the fundamental frequency waves are amplified in the z-direction into the crystal, at 
the expense of the harmonic intensity. The latter decreases to zero at C = Cq and then 
increases again until asymptotically the whole intensity is concentrated in the harmonic.t 


§111. Strong electromagnetic waves 


The formulation of the problem in §110 in terms of the generation of just one particular 
harmonic was dependent on the presence of dispersion. Let us now consider the opposite 
case, where dispersion may be regarded as absent throughout the relevant range of 
frequencies, so that the induction D(r) at every point is determined by the value of E(t) at 
the same instant.t We shall regard the medium as isotropic; D and E are then parallel. In 
the present section, the non-linearity is not assumed small, and the function D(E) is^ 
therefore arbitrary. 

Neglecting absorption and dispersion is of fundamental significance, in the sense that 
the field equations then involve no parameters having the dimension of frequency or 
(equivalently) of length. This makes it possible to construct an exact solution which 
generalizes the usual one-dimensional plane wave of the linear approximation (A. V. 
Gaponov and G. 1. Freldman, 1959).§ 

Let the wave be propagated in the x-direction, the electric field be in the y-direction, and 
the magnetic field be in the z-direction; and will be denoted simply by E and H- 
Maxwell’s equations 


1 6D 

curl H = - — curl E = - 

c 8t 


1 ^ 

c dt 


t When C > Co, the phase variable is to be given the value 6 = and the sign of the hyperbolic tangent in 
P2(0 changed. 

J 'For uniformity of exposition in this chapter, we shall continue to refer to a non-linear relation between v 
and E, assuming the medium non-magnetic. Actually, the phenomena in question usually relate to media with a 
non-linear dependence of B on H. 

§ This solution is analogous to the simple waves in one-dimensional flow of a compressible fluid (see 
f M, §94). 
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become 

dH IdD E dE dE 1 dH 

dx c dt c dt’ dx c dt' 

(111.1) 

where by definition 

1 

e(£) = d£)/d£; 

(111.2) 


As E-^0, e(E) tends to the ordinary permittivity Eq. 

We shall seek a solution in which the functions £(t, x) and H{t, x) may be expressed 
functions of each other: H = El (E). Equations (111.1) may then be written as 


In order for these two equations to be satisfied by non-zero values of the unknowns dEjdt 
and dEjdx, their determinant must be zero. From this condition, (dH/d£)^ = e(£), and so 

■ E 

j H=±fyE(£)d£. (111.4) 


INow substituting dH/dE from (111.4) in one of the equations (111.3), we have 


(dEjdt), 

(dEjdx), 




jHence x -Ectj^E may be any function of £. Denoting the inverse function hy f, we have 
£=/(x + ct/yE(£)); (111.5) 


the two signs here correspond to the two directions of propagation of the wave. When the 
function / has been chosen, (111.5) determines implicitly the function £(t, x). In weak 
1 fields, when we can take e = Eq, (111.5) becomes an ordinary plane wave with phase velocity 
I c/^Eq. The solution found exists only for e > 0, in accordance with the stability condition 
(I8.8).t 

As the wave is propagated, the profile which it has at the initial instant becomes 
? distorted, since different parts of it move at different velocities. Usually, e(£) decreases with 
I increasing £, and tends to saturation. Then the points on the profile where £ is greater will 
' travel at higher speeds, so that the slope of the forward front becomes steeper; Fig. 60 
I shows the profile at successive times. At some instant, the profile turns over and would then 
have to become no longer one-valued. In reality, at this instant an electromagnetic shock 



t The derivation given here assumes that the density, temperature, etc. of the medium are not affected by the 
field oscillations. This is justihed toth by the smallness of the striction effects and by the largeness of the wave 
propagation velocity relative to that of sound. 
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wave (a discontinuity of E and H) is formed. The boundary conditions at the discontinuity 
have the same form (76.13) as on any moving surface. For a transverse plane wave they are 


= v(D2-D^)Ic, 
E2-Ei=v{H2-H^)/c, 


( 111 . 6 ) 


where the suffixes 1 and 2 denote the values of quantities before and behind the front 
respectively. Multiplication of these two equations gives for the shock wave velocity 


v^ = c^{E2-E,)I(D2-D,). 


(111.7) 


In a shock wave there is dissipation of energy. Let Q be the rate of dissipation per unit 
area of the discontinuity surface. To calculate Q, we write the law of energy conservation 
for a cylindrical volume element in the medium with one end on each side of the 
discontinuity: 

c(£ 2H2-£i if i)/47r = i;(t/2-!/,) +e. (11L8) 


The left-hand side is the difference of the energy fluxes through the two ends; the right- 
hand side is the sum of the rate of change of the internal energy through the movement of 
the boundary of regions 1 and 2, and the energy dissipated at the boundary. The difference 
of the internal energies is (for constant density and temperature) 


(72-t/i=^ I £d£)-t-^(ii2"-iii"). 

^ 471 J 871 ' ^ ^ 

Using also equations (111.6) and (111.7), we can bring (111.8) to the form 


e = ”|i(i)2-Oi)(£2 + £i)-|£cli)|. 

If the shock wave is weak, i.e. the discontinuities in it are small, in calculating Q we can 
express the relation between D and £ as an expansion 

£)(£) = £)l+E(£l)(£-£l) + i£'(£l)(£-£l)^ 

where e'{E) = A^DjAE^. A straightforward calculation gives 

6= -££'(£,)(£2-£i)V487r. (111.9) 

The energy dissipation in a weak electromagnetic shock wave is therefore of the third order 
in the field discontinuity in the shock. Since Q>0, when e' < 0 we must have £2 > £i, as 
shown in Fig. 60. 

The presence of the shock wave makes the solution derived above no longer valid: the 
expressions (111.4), (111.5) for the field contradict the boundary conditions (111.6). It is 
important to note, however, that the wave remains approximately (up to and including 
second-order quantities) a simple wave so long as the shock wave may be regarded as 
weak.f To this accuracy, the expression for the velocity of the discontinuity can be written 


t The situation here is exactly analogous to the formatmn of ordinary hydrodynamic shock waves in a strong 
sound wave (see FM, §95), and the arguments will not be repeated in full. 
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as 

i; = c[i£(£,+£ 2 )]“^^^- (111.10) 

In the same approximation, the position of the discontinuity on the wave profile is 
determined by the equality of the two areas between the vertical line and the dashed curve 
in Fig. 60. 


PROBLEM 

A plane wave E =f{t-x/c)is incident normally from a vacuum on to the surface of a medium. Determine the 
reflected wave (L. J. F. Broer, 1963). 

Solution . The field in the vacuum (the half-space x < 0) consists of the incident wave and the reflected wave 
(suffix r): 

E=fi(t-x/c)+f^{t + xlc), 

H =fi{t-xlc)-f,{t + x/c); 

in the vacuum, the field equations are linear and the two solutions may be added. In the medium (x > 0) there is 
only the transmitted wave, in which 

E =/, [t - (x/c)^e{E)1 H = I 

The continuity condition on the electric field at x = 0 gives/,(t) The continuity of ff at x = 0 then 

/i(t)-/r(t)= I jE{E)dE, 

and this implicitly determines the function f,. 

§112. Stimulated Raman scattering 

The third-order non-linear effects include the influence of radiation with some 
frequency ajj (the pump wave) on the propagation of a wave with a different frequency coj in 
the same medium. Such effects are contained in the non-linear permittivity 

Ej y„(a)2> nji, —oji) (112.1) 

which contributes to the induction with frequency 0)2 t 

In an isotropic medium, the induction D2 at the frequency CO2, including the 
contribution mentioned, is given by 

D 2 = E 2 E 2 + *2 (El • Ei*)E2 + )?2Ei (El * • E 2 ) + T 2 El * (El ■ E 2 ), (112.2) 

where 

E, = EioC'C^'-'-E 2 = E2oC'<'‘^-'-“^‘>. (112.3) 


f The condition that the third-order permittivity £i,u„(tOi, tOj, tOj) be real requires that not only the 
frequencies to,, <1)2,0)3 and their sum to* but also their sums in pairs should not coincide with differences of energy 
levels in the system; for the susceptibility (112.1), these sums are to, -t-tOj and |a), -tOjI. This may be seen by 
tracing the origin of the energy denominators in the expression mentioned in §108 for the non-linear 
susceptibility in terms of the matrix elements of the interaction between the medium and the field; this expression 
has been given by J. A. Armstrong, N. Bloembergen, J. Ducuing and P. S. Pershan, Physical Review 127, 1918, 
1962. 
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In the first term in (112.2), £2 = £2 (gj 2) is the ordinary linear permittivity; in the other terms, 
0.2, Pi snd y2 are three independent components of the tensor (112.1), their number being 
obvious from the way in which (112.2) is constructed from the three vectors E, Ej* and E2. 
We see that the non-linear action of the field Ej on the field with frequency 0)2 can be 
described by means of an anisotropic permittivity 

£2it = (£2+a2Ei-Ei*)^i,-t-^2EiiEn*-t-y2Ei,*Ei,. (112.4) 

In a non-dissipative medium, the coefficients O2, Pi and y2 are, like £2, real, and the tensor 
(112.4) is Hermitian. When Wi 0 and accordingly Ej is real, it includes as a particular 
case double refraction in a static electric field, described by (100.1). When ^ 0, (112.4) 
also describes the gyrotropy of the medium induced by the field Ej. Comparison of (112.2) 
with (101.8) gives the gyration vector 

g = i<(^2-T2)Ei*xEi. (112.5) 


This is zero if the field Ej is linearly polarized. 

A greater variety of phenomena can occur if the non-linear interactions of the field with 
the medium are accompanied by dissipation. In that case, the coefficients O2, Pi and >’2 are 
complex; the linear permittivity will again be supposed real. It is found that such 
dissipation can cause either attenuation or amplification of the field E2. In the latter case 
we have stimulated Raman scattering.'^ 

Since the linear permittivities £(a)i) and £(0)2) there is no dissipation in the 

medium at the frequencies <x>i and <x>i: the quanta hwi and ha)2 themselves are not 
absorbed. Let the difference cOi-Wi, but not the sum + CO2, lie in the frequency range 
where the medium is capable of absorption. The dissipation takes place only by the 
conversion of quanta of higher energy into quanta of lower energy, the excess released 
being transferred to the medium. When oji > W2, therefore, the pump wave amplifies the 
wave with the lower frequency <x>2 ■ The time-averaged energy gained by the field E2 per unit 
time and volume through the weak non-linear effects is just minus the expression (96.5): 


dU2 

dt 


'"2 , * 


hki)^2i^2k* 


= -^|a2"|EiP|E2p + ^2"|Ei-E2p+T2"|Ei-E2*|"|; » (112.6) 

cf the derivation of equations (108.9). A similar expression gives the change in energy of the 
field El: 

-^|ar|Eil^|E2p + ^i"|Ei-E2p+y."|E.-E2*|"|, (112-7) 

where aj, )?i andyj are independent components of the permittivity tensor e,- y„(a),, m2’ 
— CO2), which describes the effect of a field with frequency CO2 on one with frequency coy ■ 


t From the quantum microscopic standpoint, this refers to the emission of a photon hco, when a photon to , is 
incident on atoms in a field of photons to,. The energy is transferred to the medium with the 

formation of an elementary excitation of the medium such as a phonon or an exciton. The literature of the subj^t 
uses specific names for various types of scattering process. In our purely phenomenological description, we use the 
term mentioned, as a conventional general name. 



§112 Stimulated Raman scattering 393 

Arguments similar to those used in §107 to derive the Manley-Rowe theorem show that 


OJ, dt 0)2 dt 

for each quantum h<x>2 created, one quantum disappears. Hence it follows that 

ai"=-a2", Pr"=-p2\ yi'= -yi"- (112.9) 

The energy dissipated is given by the decrease in the total energy of the two fields; 
dUi dU2 (Oi-co2dU2 
dt dt <x>2 dt 


( 112 . 10 ) 


I When coi > t02, the condition Q>0 shows that dU2/dt > 0; that is, the wave with the 
lower frequency is amplified, as stated above. The condition that the expression (112.6) be 
positive is given by the inequalities! 

0L2" < 0 , 0.2"+ P2' < “2"+ 72” = 0’ “2"+ ^2"+ 72" < 0 - ( 112 . 11 ) 

This effect is independent of the phase relations between the fields, because the pump 
wave field appears in the equations as expressions bilinear in Ej and E,*, from which the 
phase factors vanish. The ultimate result is that field synchronism is not necessary for 
amplification of the field co2, in contrast to the harmonic generation and parametric signal 
amplification effects discussed in §110. 

It is possible to relate the characteristics of the stimulated Raman scattering to those of 
the ordinary (spontaneous) scattering to be discussed in Chapter XV. The relevant 
[calculations are given in §118, Problem. 

The above relations are valid, as already mentioned, if energy is absorbed by the medium 
only at the difference frequency ajj — CO2 • There is a different situation if the sum frequency 
coi + CO2, not the difference, lies in the absorption range. In this case, for each quantum ha2 
absorbed, a quantum hw^ is also absorbed, and an energy h{wi + 0)2) is transferred to the 
medium (two-photon absorption). In this case, of course, the waves of both frequencies are 
attenuated. 


t This may be seen by considering the values of (112.6) for various polarizations of the fields E, = e, £, and 
I = e2£2- Imear polari^tions in parallel or perpendicular directions, circular polarizations with the same or 
opposite signs. In th^first two rases e, and are real, with e, - £2 = 1 or 0; in the second two they are complex. 



CHAPTER XIV 


THE PASSAGE OF FAST PARTICLES 
THROUGH MATTER 


§113. Ionization losses by fast particles in matter; the non-relativistic case 

A FAST charged particle, in passing through matter, ionizes the atoms and thereby loses 
energy.! In gases, the ionization losses can be regarded as being due to collisions between 
the fast particle and individual atoms. In a solid or liquid medium, however, several atoms 
may interact simultaneously with the particle. The effect of this on the energy loss by the 
particle can be macroscopically regarded as resulting from the dielectric polarization of 
the medium by the charge. Let us first consider this effect for non-relativistic velocities of 
the particle. We shall see that the polarization of the medium then has only a slight effect 
on the losses. The derivation of this result is of interest because the method can be 
extended to other cases. 

Let us first of all ascertain the conditions under which the phenomenon can be 
macroscopically considered. The spectral resolution of the field produced at a distance r 
from the path of a particle moving with velocity v consists chiefly of terms whose frequency 
is of the order vjr (the reciprocal of the “collision time”). The ionization of an atom can be 
effected by field components of frequency co > where coq is some mean frequency 
corresponding to the motion of the majority of the electrons in the atom. The particle 
therefore interacts simultaneously with many atoms if vJwq is large compared with the 
interatomic distances. In solids and liquids these distances are of the same order of 
magnitude as the dimension a of the atoms themselves. Thus we obtain the condition 
V P atoo, i.e. the velocity of the ionizing particle must be large compared with the velocities 
of the atomic electrons (or at least of the majority of themj.f 

Let us now determine the field produced by a charged particle moving through matter. 
In the non-relativistic case it is sufficient to consider only the electric field, defined by the 
scalar potential (f). This potential satisfies Poisson’s equation 

eA(/) = — 47rc^(r —vt), (113.1) 

in which the permittivity is written as an operator, and the expression eS (r — vt) on the 
right-hand side is the density due to a point charge e moving with constant velocity v.§ 


these are, of course, understood to include losses due to 
of the 


t We speak, as is customary, of “ionization losses’ 
the excitation of atoms to discrete energy levels. 

t The corresponding condition for the energy E of the particle is £ > MI, m, where M is the 
particle, m that of the electron, and / some mean ionization energy for the majority of the electrons in the atoi^ 
§ We assume that the particle moves in a straight line, and thereby neglect scattering, as is always permissible m 
problems of this type. 

If the charge on the particle is ze, 
sections should be multiplied by z^. 


le formulae pertaini 


ergy loss in this and the following 
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We expand tj) as a Fourier space integral: 


Taking the Laplacian of this equation, we find that the Fourier component of A(/) is 
-k^4>y. 

Taking the Fourier component of equation (113.1) gives 

E{A(t>\ = — j'47rc^ (r — vt) exp( —ik •r)dK 
= — 47rc exp( —itv-k). 

hus E(/)|( = (Anelk^) exp ( —itv -k), and 4)^^ therefore depends on time through a factor 
xp ( —itvk). The operator e acting on a function exp( —icot) multiplies it by E(a)). 
4ence 

4ne 


The Fourier components of the field and of the potential are related by exp(ik • r) = 

-grad [(/)|j exp (ik-r)] = —/k(/)^ exp(/kT). Thus 


The total field strength is obtained by inverting the Fourier transform: 


The energy loss by the moving particle is just the work done by the force cE exerted on 
the particle by the field which it produces. Taking the value of the field at the point 
ccupied by the particle, namely r = vt, we obtain in the integrand in (113.4) a factor 
sxp (itv • k) which cancels with the factor exp (— itv • k) in the expression (113.3) for E^. 
ience the force F is 


It is evident that the direction of the force F is opposite to that of the velocity v; let the 
ter be the x-direction. Putting k^v = m, q = -^{ky^ + k^^) and replacing dk^dk^ by 
inq dq, we can write the magnitude of F as 


'The choice of qo is discussed below. 
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The following remark should be made concerning the integration with respect to oj in 
formula (113.5). As a)-> oo the function E(a))-> 1, and the integral is logarithmically 
divergent. This happens because we ought to have subtracted from the field E the field 
which would be present if the particle were moving in a vacuum (i.e. if e = 1); this field 
evidently does not affect the energy lost by the particle in matter. 

If this subtraction were effected, 1/e in the integrand of (113.5) would become 1/e — 1, 
and the integral would converge. The same result can be obtained by taking the integration 
from — Q to + Q and then letting Q tend to infinity. Since the function e' (oj) is even, the real 
part of the integrand is an odd function of the frequency, and gives zero. The integral of the 
imaginary part of the integrand converges. 

In what follows we shall sometimes find it convenient to use the notation 

1/e (oj) = r](w) = t]' + it]", (113.6) 

with t]'(oS) and t]"(o}) respectively even and odd functions, and t}" = — e"/|eP<0. 
Formula (113.5) can be rewritten in the explicitly real form 

f--f (113.7) 

71 J J (g^v^ + co^) 

0 0 

The energy loss per unit path length is the work done by the force over that distance, which 
is just F; it is called the stopping power of the substance with respect to the particle. 

According to the general rules of quantum mechanics, the Fourier component of the 
field whose wave vector is k transmits to the ^-electron released in ionization a momentum 
hk. For sufficiently large q (P (OqIv) we have + co^/v^ k q^, so that the momentum 

transferred is approximately hq. A given value of q corresponds to collisions with impact 
parameter ~ l/q. Hence the condition for the macroscopic treatment to be valid is \/q P a. 
Accordingly, we take as the upper limit of integration a value q^ such that oj^y/v 4: qo^ 

The quantity F(^o) is the energy loss of a fast particle with transfer of momentum no) 
exceeding hq^ to the atomic electron. 

Integrating with respect to q in (113.7), we obtain 

F{Qo) = ^ [ m|f/"(m)llog—dm. (113-^i 

nv J CO 

This formula cannot be further transformed in a general manner, but it can be written in a 
more convenient form as follows. We first calculate the integral 

J coq" {a)do} = —ji J (m/e) dm. 

To do so, we notice that, if the integration is taken in the complex m-plane along a contour 
consisting of the real axis and a very large semicircle a in the upper half-plane, the Integra i 
zero, since the integrand has no poles in the upper half-plane. For large values of m, th 
function e(w) is given by formula (78.1): 
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The integration along the large semicircle a can be carried out by using this formula, and 
the result ist 

IniNe^ f dm _ ,, 

(m)dm =- - — I — = In^Ne^lm. (113.10) 




We define a mean frequency of the motion of the atomic electrons by 

j cot]" (m) log CO dm 
log m =- 


j cot]"{co)dco 


- f f 

In^Ne^ J 


m|»/''(m)|lhgmdm. 


iThen formula (113.8) can be written 


F{qo) = {4TiNe‘^/mv^)log{qov/co). 


(113.11) 


(113.12) 


The following remark should be made here. It might seem from the form of (113.7) or 
(113.11) that the main contribution to the ionization losses (113.12) comes from 
trequencies at which there is considerable absorption. This is not so; these formulae may 
lontain a considerable contribution from ranges in which e" is small. The reason is that in 
inch ranges the function e(m) » e'(m) may pass through zero. It is seen from formula 
[(113.5) that the zeros of e(m) are poles of the integrand. In reality, of course, e"(m) is not 
pxactly zero, and so the zeros of e (m) are not on the real axis but just below it. Hence, when 
uhe expression used for e (m) is real and passes through zero, the contour must be indented 
ipwards at the pole of the integrand, and so a contribution to the integral occurs. For 
!xample, if the function e(m) is given by (84.5), the contribution to the energy loss (113.12) 
from the poles ±coi (where e(mi) = 0) is easily seen, by direct calculation from (113.7), to 
! (AnNe'^mv^a^) log (q^^vlcoi). 

In order to find the energy loss F{qi) with transfer of momentum not exceeding 
some value fig, > hq^, we must “join” formula (113.12) to that given by the quantum 
leory of collisions, corresponding to energy loss by collisions with single atoms. This 
-an be done by using the fact that the ranges of applicability of the two formulae overlap. 
As we know from the theory of collisions, the energy loss with transfer of momentum in a 
^ange of hdq is 

dF — {4TiNe‘'’/mv^)(iq/q, (113.13) 

and this formula is applicable (in the non-relativistic case) for any value of q ^ coq/v which 
's compatible with the laws of conservation of momentum and energy, provided that the 
Jnergy transferred is small compared with the initial energy of the fast particle.f The 


t This is the same as (8Zl2), as it should be, since, as |co| -» oo, |£| — 1 and tj" — -e". 
t See QM, §149. The effective retardation” used there differs from f by a factor N = N/Z, the number 
jPensity of atoms. 

I i^ieveltxtendfto of applicability as hitherto determined 

(9^ mo/r), however extends to i^lues of 9 for which the atomic electrons cannot be regarded as free. The 
iFondition for this ^ 2%^ of the of velociiy of the majority of the atomic 

electrons; the energy « the 6-eleciron is then large compared with atomic energies. 
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energy loss with all values of q between Qq and is accordingly {AnNe'^lmv^) log (Qi/Qo)- 
When this quantity is added to formula (113.12), Qo is replaced by Qi, so that 

F{qi) = (4jiNe‘^/mv^)log(qiv/(S). (113.14) 


If a momentum hqi large compared with the atomic momenta is given to an atomic 
electron, its energy is £, = h^q^^/lm. Thus we can write 

F(Ei) = (InNe'^jimf') log {2mv^E^/h^co^). (113.15) 

Formulae (113.14) and (113.15) give the energy loss of a fast particle by ionization with a 
transfer of energy not exceeding a value £, that is small compared with the original energy 
of the particle. It must be emphasized that with this condition the formulae are equally 
valid for fast electrons and fast heavy particles. Formula (113.15) differs from the formula 
derived from a microscopic discussion, neglecting interactions between atoms (QM, 
(149.14)) only by the definition of the “ionization energy” /, which is here represented by 
hco. The mean (with respect to the electrons) ionization energy of an atom is usually almost 
independent of its interaction with other atoms, being determined mainly by the electrons 
of the inner shells, which are almost unaffected by that interaction. Moreover, this quantity 
appears here only in a logarithm, and so the exact definition of it has even less effect on the 
magnitude of the energy loss. 

In a collision between a heavy particle and an electron, even the maximum transferable 
momentum is small compared with the momentum Mv of the particle. The change in 
the energy of the heavy particle is therefore v • fiq; equating this to the energy of the electron 
givesfi^q^/2m = fiq-v ^ fiqi;, whence fig^ax = 2m!;,and£i ^ax = 2mi;^. Substituting for £i 
in (113.15), we obtain as the total ionization energy loss by the fast particle 


4nNe'^ 

mv^ 


log- 


(113.16) 


This differs from the usual expression (QM, (150.10)) only in the definition of the 
ionization energy fico. % 

We can see how hw defined by (113.11) becomes, in a rarefied medium, the mean 
ionization energy of a single atom given by QM, (149.11). To do so, we note that in a 
rarefied gas, which for simplicity we suppose to consist of uniform atoms, the permittivity 
is £= l+47rN„a(aj), where is the number of atoms per unit volume, a(co) the 
polarizability of one atom; here |£-1|<? 1. The imaginary part of »/ = 

= 4TiN^a"{co). The polarizability of the atom is given by QED, (85.13); separating t e 
imaginary part by means of QED (75.19), we have when oj > 0 


kn = f X I do„ 1^ ^ (£„ - £o - 


where Eq and £„ are the energies of the ground state and excited states of the atom. 
Substitution of this expression in (113.11) and carrying out the integration, with N = N^Z, 
gives the definition in QM, (149.11). 
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§114. Ionization losses by fast particles in matter: the relativistic case 

At velocities comparable with that of light, the effect of the polarization of the medium 
on its stopping power with respect to a fast particle may become very important even in 


To derive the appropriate formulae, we use amethod analogous to that used in §113, but 
it is now necessary to begin from the complete Maxwell’s equations. When extraneous 
charges are present with volume density Pex> ^nd extraneous currents wih density je*, these 
equations aref 

div H = 0, curlE= —(114.1) 

cdt ' 

1 deE 4n 

div eE = 47tp„, curl H = —r—I-jex- (114.2) 

c at c 

In the present case the extraneous charge and current distribution are given by 

Pex = fr - vt), je,, = ev ^ (r - vt). (114.3) 

We introduce scalar and vector potentials, with the usual definitions: 

1 oA 

H = curl A, E =-^-grad (j), (114.4) 

c ot 

so that equations (114.1) are satisfied identically. The additional condition 

divA + ^^ = 0 (114.5) 

c ot 


is imposed on the potentials A and (j); this is a generalization of the usual Lorentz condition 
in the theory of radiation. Then, substituting (114.4) in (114.2), we obtain the following 
equations for the potentials: 


, e o^A 4n 

AA--ev^(r —vt), 

c ot c 


^ —2 "Tx 1 “ “ ^ (r — vt). 


(114.6) 


We expand A and (/> as Fourier space integrals. Taking the Fourier components of 
equations (114.6), we have 




, 2 . e d^Ai. 4ne\ 




E 

l = 47re exp(-itvk). 


t This effect was pointed ( 

whose atoms are regarded £ 
+ We put/£(«)= 1, since 
ionization losses. 


jt by E. Fermi (1940), who performed the calculation for the particular case of a gas 
harmonic oscillators. The general derivation given here is due to L. Landau, 
la er oes not exhibit magnetic properties at the frequencies important as regards 
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Hence we see that and depend on time through a factor exp (— itv • k). We again put 
CO = k • V = k^v, and obtain 




</>k - 


Ane 


£(a)) — co^e(co)/c^ 

The Fourier component of the electric field is 


(114.7) 


Ell = imAii/c - ik(/)ii. (114.8) 

From these formulae the force F = cE acting on the particle is found in the same way as 
in §113.t Using the same notation, we now have 






toq dq dm 


-J 


(114.9) 


As c -+ 30 this formula tends, of course, to (113.5). 

Let us first carry out the integration with respect to frequency. In order to effect an 
integration in the complex m-plane, we first ascertain the poles of the integrand in the 
upper half-plane. The function e{co) has no singularity and no zero in this half-plane, and so 
the required poles can only be the zeros of the expression 



We shall show that, for any value of the positive real quantity q^, this expression vanishes 
for only one value of co. 

To prove this,J we use a theorem in the theory of functions of a complex variable: the 
integral 

_L (f 14.10) 

2m J dm /(m) — a 
c 

taken along a closed contour C, is equal to the difference between the numbers of zeros and 
poles of/(m) — a in the region bounded by C. Let 

r, . 2 A(") 1 ^ 

/«»).<» 

a = q^ be a positive real number, and C be a contour consisting of the real axis and 
a very large semicircle (Fig. 61). The function/(m) has no pole in the upper half-plane 


t The magnetic force e\ x H/c is seen by symmetry to be zero, and in any case is perpendicular to the velocity of 
the particle and so does no work on it. 

t The following argument is analogous to the proof {SP 1, § 123) that e (co) has no zero in the upper half-plane- 
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or on the real axist; the integral (114.10) therefore gives the number of zeros of the function 
/(co)-a in the upper half-plane. To calculate its value, we write it as 


2m]j-a 


(114.11) 


I the integration being taken along a contour C in the plane of the complex variable / 
which maps the contour C from the m-plane. For m = 0,/= 0. For positive real to we have 
im/> 0, and for negative real m, im/<0. At infinity/-^ [(l/i;^)-(l/c^)], and 
[I therefore/goes round a large circle when m goes round the large semicircle. Hence we see 
that the path of integration C in the /-plane is of the kind shown schematically in Fig. 61. 
When a is real and positive, as in Fig. 61, in going round C the argument of the complex 
I number changes by In, and the integral (114.11) is equal to unity. This completes the 
proof.t 

Furthermore, it is easy to see that this single root of the equation/(m) — q^=0 lies on 
the imaginary co-axis; for purely imaginary co the function/(co), like e{co), is real and takes 
all values from 0 to co, including q^. 

Let us now return to the integral with respect to co in (114.9): 
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This can be written as the difference between the integral along the contour C and that 
along the large semicircle. The latter is j d co/co = in, and the former is 2ni times the residue 
of the integrand at its only pole. Let co(q) be the function defined by the equation 


(114.12) 


Then, since the residue of an expression f(z)/<j)(z) at a pole z = is/(Zo)/(/)'( zq), the 
integral along C is 

CO 

dco 

Collecting these expressions and substituting in (114.9), we have 



F = 



q dq/dco 


+ 1 


qdq 


or, replacing the integration with respect to q in the first term by one with respect to a 

“( 9 o) __ 

F = 




«(go) 


= ? j fe-'] 


codco + \e^ qff 


(114.13) 


Large values of q correspond to large absolute values co of the root of equation (114.12)- 
Using therefore the expression (113.9) for e(co), we find 


f T An Ne^ \ 
-i’ y Uo + ^^2 - j’ 


where we have put y = 1/^(1 Substitution in (114.13) gives 


F 



e^ 

2v^y^ 


co^O); 


(114.14) 


in the integral, only the leading term ivqoy need be retained m co (Qq)- 

The integration in (114.14) is over purely imaginary values of co. We use the real vanaD> 
co" = co/i, with the lower limit ^ = co(0)/i, and again put l/e = i? (113.6). The require 
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iintegral is 


■I 


[»/(ia)")— lja)"da)". 


t rhe values of the function »/(a)) on the imaginary axis can be expressed in terms of its 
maginary part on the real axis: 

» J x' + O) 

J[cf. (82.15)). Hence the integral is (if we neglect x in comparison with vq^y) 

2 f f x|»/"(x)|aj"dQj"dx If v\oY . 

0 « o 

substitute this result in (114.14), and for simplicity put 


logQ = |log(aj^ + ^^). (114.15) 

^here the bar denotes an averaging with weight co\q"{to)\, as in (113.11). Then 

, AnNe'^, vq^y InNe'^ e^ 

= + (114.16, 


Two cases must be considered in the further examination of this formula. Let us first 
uppose that the medium is a dielectric, and that the velocity of the particle satisfies the 
3ndition 

v^<c^/eo, (114.17) 

('here Eq = e (0) is the electrostatic value of the permittivity. On the imaginary axis the 
motion e(a)) decreases monotonically from Eq > 1 for m = 0 to 1 for oj = ico. The 
xpression on the left-hand side of equation (114.12) therefore increases monotonically 
rom 0 to CO, and for q = 0 (114.12) gives m = 0. Thus we must put ^ = 0 in (114.16); then n 
ecomes the mean atomic frequency w (113.11), and 


mv^ |_ CO 2c^j 

or v<^c this formula becomes (113.12), as it should. 

The value of q^ is such that q^ <? 1/a, where a is the order of magnitude of the 
teratomic distances (in solids and liquids equal to the dimension of the atoms). In order 
o extend the formula to higher values of the transferred momentum and energy, it must be 
ioined to the formulae of the ordinary theory of collisions, as in §113, but the joining 
nust now be carried out in two stages. First, using formula (113.13), we enter the range ofq 
corresponding to energy transfers large compared with atomic energies but not yet 
elativistic. Formula (114.18) is unchanged in form, but may now involve the ^-electron 
nergy /2w. Calling this £,, we have 
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We can now go on to the relativistic values of £, by using a formula of relativistic collision 
theory, according to which the stopping power with energy transfer between E and 

^ {2nNe'^lmv^)dEIE (114.20) 

if E is small compared with the maximum transfer compatible with the laws of 
conservation of momentum and energy for a collision between the fast particle concerned 
and a free electron.t Since the integration of (114.20) gives a term in log E, it is clear that 
formula (114.19) is unchanged in form, and it is therefore valid for all <? Ei 
In the retardation of a fast heavy particle (with mass M ^ m and energy E which, 
though relativistic, is such that E c^lm), the maximum energy transfer to an electron 
is £i,max = 2 mv^y^ and is still small in comparison with E (see QED, (82.23)). For such 
particles the differential expression for the energy lost to free electrons is 


27iNeUl _L_^d£' 

mv^ \E' Irtufy^) 


for all £'; see QED, (82.24). The energy loss additional to (114.19), with energy transfer 
from El to £i „,ax (with Ej <? £] ^ax) is then 


InNe^f £, „,ax ^l.max \ 

mv^ El Imc^y^J 


log 


2mv^ y^ 

^~Ei 


(114.21) 


Adding this to (114.19), we find the total stopping power with respect to the heavy particle: 


47r Ae'*’ ( 2mv^ y^ 

-2~ log —7^- 

mv \ hoo 


(114.22) 


Formula (114.22) differs from that of the usual theory only in that the “ionization energy” 
is hco; cf. QED, (82.26). 

Let us now turn to the second case, namely that where 


(114.23) 

which, in particular, always holds for metals, where £ (0) = co. The expression 
co^ (e/c^ — l/i;^) on the left-hand side of equation (114.12) then has two zeros on the 
imaginary co-axis, one at co = 0 and the other at co = i^, where ^ is defined by * 

e{i^) = c^lv\ (114.24) 

In the range from 0 to i^ the expression co^ (e/c^ - 1 /v^) is negative, and for | co | > <^ it takes 
all positive values from 0 to co. Asq -»0, therefore, the root of equation (114.12) in this case 
tends to which is the value to be substituted in (114.15) and (114.16). 

Two limiting cases may be considered. If ^ is small compared with the atomic frequencies 
cOq, then the last term in (114.16) may be neglected, and fi = co. Thus we return to formula 
(114.18). The opposite limiting case, where ^ > coq, is of particular interest. Since, for large 
the function e(i(^) tends to 1, it is evident from (114.24) that this case corresponds to ultra- 
relativistic velocities of the particle. Using formula (113.9) for e(co), we can write from 
equation (114.24) 

3 = 4nNe^v^y^/mc^ = 4nNe^y^fm. 


t See QED, (81.15) and (82.24). The stopping power F is obtained on multiplying these expressions for the 
cross-section by the energy loss mA and by N. 
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As the velocity of the particle increases, the condition ^ ojo is ultimately fulfilled in any 
medium, i.e. whatever the electron density N (even in a gas). The velocity required is, 
however, the greater, the smaller N, i.e. the more rarefied the medium. 

From (114.15) we then have simply n = (^. Putting also v = c, we find that the last two 
I terms in (114.16) cancel, leaving 

I F(qo) = [271 Ne'^lmc^) log {mc^ Ne^). 


Extending this formula, in the same manner as above, to large values of the momentum and 
I energy transfer, we find the following expression for the energy loss of an ultra-relativistic 
j particle with an energy transfer not exceeding £, (<? 


F(£i) = {271 Ne'^lmc^) log {mVEJ27i Ne^ h^). (114.25) 


This result is considerably different from that obtained in the ordinary theory, which 
neglects the polarization of the medium. According to that theory (see QED, §82), in the 
ultra-relativistic range the stopping power F(Ei) continues to increase (though only 
logarithmically) with the energy of the particlert 


271Ne^ , flmc^y^E, 

F{E,) = -^log - 


The polarization of the medium results in a screening of the charge, and the increase in the 
losses is thereby finally stopped; it tends to the constant value (independent of y) given by 
formula (114.25). 

For heavy particles a formula can also be derived for the total stopping power with any 
energy transfer up to Ei^max (if the latter is small compared with the energy of the particle 
itself). Again using the expression (114.21), in which we can now put r = c, we find 


F 


2-71 Ne'^ 
mc^ 


r 1 


(114.26) 


We see that the total stopping power continues to increase with the velocity of the particle, 
owing to close collisions with a large energy transfer, for which the polarization of the 
medium has no screening effect. This increase, however, is rather slower than that given by 
the theory when the polarization is neglected. According to that theory. 


F 


471 Ne'^ 
mc^ 




see QED, (82.28). The coefficient of the log y term here is twice that in (114.26). 

It may also be noted that the presence of the electron density N in the argument of the 
logarithm in formulae (114.25) and (114.26) results in the following property of energy 
losses of ultra-relativistic particles: when such a particle passes through layers of different 
substances containing the same number of electrons per unit surface area, the losses are 
smaller in media with larger N. 


t This formula is obtained by adding QED (82.20) and (82.25), with £, for mA^ax in the 
energy transfer the formulae apply to both fast electrons and fast heavy particles. 


latter. For a small 
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§115. Cherenkov radiation 

A charged particle moving in a transparent medium emits, in certain circumstances, an 
unusual type of radiation, first observed by P. A. Cherenkov and S. I. Vavilov, and 
theoretically interpreted by I. E. Tamm and I. M. Frank (1937). It must be emphasized that 
this radiation is entirely unrelated to the bremsstrahlung which is almost always emitted by 
a rapidly moving electron. The latter radiation is emitted by the moving electron itself 
when it collides with atoms. The Cherenkov effect, however, involves radiation emitted by 
the medium under the action of the field of the particle moving in it. The distinction 
between the two types of radiation appears with particular clarity when the particle has a 
very large mass: the bremsstrahlung disappears, but the Cherenkov radiation is unaffected. 

The wave number and frequency of an electromagnetic wave propagated in a 
transparent medium are related by k = uoj/c, where « = .,/e is the refractive index, which is 
real. We again suppose the medium isotropic and non-magnetic. We have seen that the 
frequency of the Fourier component of the field of a particle moving uniformly in the 
x-direction in a medium is related to the x-component of the wave vector by oj = k^v. If 
this component is a freely propagated wave, these two relations must be consistent. Since 
k > k^, it follows that we must have 

V > cjn (Qj). (115.1) 

Thus radiation of frequency oj occurs if the velocity of the particle exceeds the phase 
velocity of waves of that frequency in the medium concerned.f 

Let 6 be the angle between the direction of motion of the particle and the direction of 
emission. We have k^ = k cos 6 = {ncalc) cos 6 and, since k^ = o/v, we find that 

cos 6 = c/nv. (115.2) 

Thus a definite value of the angle 6 corresponds to radiation of a given frequency. That is, 
the radiation of each frequency is emitted forwards, and is distributed over the surface of a 
cone with vertical angle 26, where 6 is given by (115.2). The distributions of the radiation in 
angle and in frequency are thus related in a definite manner. 

The emission of electromagnetic waves, if it occurs, involves a loss of energy by thfe 
moving particle. This loss forms part, though a small part, of the total losses calculated in 
§114. (The bremsstrahlung is not included therein.) In this sense the term “ionization 
losses” is not quite accurate. We shall now find the corresponding part of the total losses, 
and thus determine the intensity of the Cherenkov radiation. 

According to (114.9), the energy loss in the frequency interval dm is 


dF = 


— dm — y m 

71 



where the summation is over terms with m = ± | m |. We introduce as a 



variable 


t The problem of radiation from an electron moving uniformly in a vacuum at a velocity v > c was discussed 
by A. Sommerfeld (1904) before the theory of relativity became known 
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iThen 



In integrating along the real (J-axis we must pass round the singular point ^ = 0 (for which 
j|2 -I- in some manner, which is determined by the fact that, although we suppose 

■ (m) real (the medium being transparent), it actually has a small imaginary part, which is 
positive for co > 0 and negative for co < 0. Accordingly, ^ has a small negative or positive 
imaginary part, and the path of integration ought to pass below or above the real axis 
respectively. This means that, when the path of integration is displaced to the real axis, we 
must pass below or above the singular point respectively. This gives a contribution to df, 
and the real parts cancel in the sum. Indenting the path of integration with infinitesimal 
lemicircles, we find 

X m J dm = m { J d^/^ - J d^lQ = 2inco. 

Thus the final formula is 


hich gives the intensity of the radiation in a frequency interval dm. According to (115.2), 
lis radiation is emitted in an angle interval. 


de = 


c dn , 
2 ■ 

vn sin U dm 


(115.4) 


The total intensity of the radiation is obtained by integrating (115.3) over all frequencies 
for which the medium is transparent. 

1 It is easy to determine the polarization of the Cherenkov radiation. As we see from 
(114.7) the vector potential of the radiation field is parallel to the velocity v. The magnetic 
field = ikxA^ is therefore perpendicular to the plane containing v and the ray 
direction k. The electric field (in the “wave region”) is perpendicular to the magnetic field, 
and therefore lies in that plane. 

J 


PROBLEM 


I I Find the cone of Cherenkov radiation wave vectors for a particle moving uniformly in a uniaxial non-magnetic 
ttystal: (a) along the optical axis, (b) at right angles to the optical axis (V. L. Ginzburg, 1940). 


L Solution, (a) When a charge moves in a uniaxial crystal, the Cherenkov radiation is in general on two cones 

prresponding to the ordinary and extraordinary waves. In motion along the optical axis, however, the ordinary 

K is not emitted, even though a condition such as (115.1) may be satisfied: this wave always has linear 
zation with the vector E perpendicular to the principal cross-section (that is, the plane through the optical 
-which we take as the z-axis—and the direction of any given k), and the emission of such a wave in the case 
rned is evidently impossible, since the work eE-v = 0 and the particle does not lose energy. The 
srdinary radiation cone is found by substituting in (98.5) the value of n from (115.2), which is valid even if the 
im is not isotropic; in the present case, the angle 6 between k and v is the same as the angle between k and the 
il axis. The result is 

tan^ e = (£,/£j^) - 1), 


and we must have v> c /^This is a circular cone on which the intensity distribution is uniform over the 
) generators (as is in any case obvious from symmetry). The vertical angle 2 S of the ray vector cone is related to 6 by 
tan S = (£j./£i) tanS. 
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(b) In this case, there are two Cherenkov cones. We take the direction of v as the x-axis, and the optical axis as 
the z-axis; 6 is the angle between k and the x-axis, and tp the azimuth of k measured from the xy-plane (Fig. 62). 
The cone angle for the ordinary waves is given by 

cose = cjvfEi^, 



and we must have v > This is a circular cone, but the radiation intensity depends on the azimutht: in 

particular, there is no radiation in the xz-plane (<j> = ^n), since v • E = 0. The extraordinary wave cone is not 
circular; its vertical angle depends on <p: 

cos^ e = + 

(s, -sx)sin^ (l>+E^E,v^/c^ ’ 

and we must have v > c/fEt. The extraordinary radiation is polarized with the vector D in the principal cross- 
section, perpendicular to k. If k lies in the xy-plane = 0), then D and also E are in the z-direction: here v • E 
= 0, so that the intensity of the extraordinary radiation is zero in the xy-plane. 


§116. Transition radiation 

Cherenkov radiation has the property that it occurs with uniform motion of a charged 
particle (whereas a charge moving uniformly in a vacuum does not radiate). A different 
class of phenomena that are similar in this respect is represented by transition radiation, 
which occurs with uniform motion of a charged particle in a spatially inhomogeneous 
medium, for example when passing from one medium to another (V. L. Ginzburg and 
I. M. Frank, 1945). It differs in principle from the Cherenkov radiation, in that it occurs for 
any velocity of the particle, not necessarily exceeding the phase velocity of light in the 
medium. Like the Cherenkov radiation, it is unrelated to the bremsstrahlung which also 
occurs when charged particles are incident on a surface separating two media, and the 
distinction is particularly clear in the limit of a particle with infinite mass, for which the 
bremsstrahlung is zero but the transition radiation is not. 

Let us consider the transition radiation when a charged particle passes (with constant 
velocity v) across the boundary between a vacuum and a dielectric (non-magnetic) medium 
with complex permittivity e. The motion takes place along the x-axis, at right angles to the 
interface plane (x = 0; Fig. 63). 


t The determination of the intensity distribution would need a calculation of the stopping power, similar to 
that given in §114 for an isotropic medium. These calculations and certain other topics relating to Cherenkov 
radiation are described in the review articles by B. M. Bolotovskil, U spekhi fizicheskikh nauk 62,201,1957; Soviet 
Physics Vspekhi 4, 781, 1962. 
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The electromagnetic field is determined by the equations (114.1)-(114.3) or by the 
equivalent ones (114.6). All quantities in these equations can be expanded as Fourier 
integrals with respect to the time and the coordinates y and z, for which the medium is 
homogeneous: i 

E=jE„,(x)e'«i—“'>dmdV(27t)^ I (116.1) 

etc. where q is the wave vector in the yz-plane. In each of the two Wfispaces, we seek the 
field as the sum of a particular solution of the inhomogeneous equatioife^l4.6) (the field 
of the charge, denoted by the index (e)), and the general solution of the homogeneous 
equations (114.6) with zero on the right (the free radiation field, denoted by thethdgx (r)). 
The former is given by formulae analogous to (114.7): 


47ter ( 


,2j-l 


(116.2) 


The electric field is therefore 




(116.3) 

the expression for will not be needed, and will not be written out. 

The second part of the solution, corresponding to the free radiation field, may be put 
in the form of the electric field immediately. The longitudinal component of is 
= iae^’^^ with a coefficient a as yet unknown; then 

K^ + q^ = eco^lc^. (116.4) 

The transverse component, which must obviously be along q, the only distinctive direction 
in this plane, is then determined by the equation div D = 0, i.e. £(q + k^n) • E = 0 (where 
n is a unit vector along the x-axis): 

The plus and minus signs in the exponent refer to the half-spaces x > 0 and x < 0 
respectively: the waves are propagated away from the interface.t 


t Inequations(116.2)-(116.5),8istobetakenas£, = 1 inx < Oandas£ = , 
everywhere means £ 2 - 


0. In the formulae below. 
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The constants Oi and 02 in the two half-spaces are determined from the conditions of 
continuity at the interface for the normal component en • of the electric induction and 
the tangential component q • of the electric field; the continuity of the magnetic field 
provides no further information, of course. The result for the coefficient in the vacuum 

4nePK^(s-l)[l-p^ + P^[e-K^)] 

Oi = -;-;--;-7-;—, ( 116 . 6 ) 

Wil-P^ + [1 pJiE - K^) ] [y (£ - k") -f eV(1 - k") ] 

where P = v/c, k = qc/co. 

Let us now calculate the total energy radiated by the particle into the vacuum, i.e. 
backwards relative to its motion. A simple way of doing this is to consider the emitted 
wave-train for long times t, when it is already at a great distance to the left; the radiation 
field is then separated from the intrinsic field of the charge. The energy is found by 
integrating the radiation field energy density over all space. If the origin is moved along the 
axis into the region of the wave-train, the integration with respect to x can he taken from 
— 00 to 00, because the field is attenuated in both directions. 

In the wave region, the electric and magnetic energy densities are equal. Hence 


Substituting Ej as the expansion (116.1), we write the square of the integral as a double 
integral: 

, f .dm dm' d^q d V 

E^ (t, r) = I E^(x) • E„,q.*(x) exp {i [r • (q - q') - t(m - m ) ]}--. 

Integration over dy dz gives the delta function ( 27 t)^^(q - q'), which is then eliminated by 
the integration over d^q'. Thus 


J (116-7) 

Substituting (116.5) (with £ = 1, a = Oj) for E^^, and carrying out the integration with 
respect to x, we find 


here we have also used the fact that, because of the delta function, m = m'; for the same 
reason, the phase factors which arise in Oi when the origin is moved in (116.5) disappear in 
the product ai(m, q)ai*(m', q). The integration with respect to m and m' is taken from - co 
to 00; eliminating the delta function by the second of these leaves 


(116.8) 
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since the integrand is an even function of w, the integral with respect to co has been written 
as twice the integral from 0 to oo. 

The integration over d^q is to be taken over the range < a? jc^ in which is real, so 
that the field (116.5) actually represents the wave propagated.t We use the angle Q between 
the radiation wave vector k = ik^,q) and the direction of the vector -v, so that 6 = 0 
corresponds to radiation in the direction exactly opposite to the motion of the particle. 
Then q = (a)/c)sine, and a change from integration over d}q to one over Inqdq 
= {Inco^/c^)sm6cos6d6 gives 


<^1 



loip oj^^^^-^d6 do) 
sine 


1 (oj, 6) ■ 2n sin 6 d6 dw. 

0 0 

The function ^iio},e) gives the distribution of the radiation in frequency and angle. 
With Oj from (116.6), we have finally 

e^P^sin^Ocos^B I (a—1)[1 — + —sin^S)] 

“ 7tMl-^"cos"e)^ I [1 + (e _ sin^e) ] [£cos 6 + ^(fi - sin^S) ] 

{V. L. Ginzburg and I. M. Frank, 1945). The transition radiation is linearly polarized, and 
the electric vector is seen from (116.5) to be coplanar with k and v. For non-relativistic 
velocities, the radiation intensity is proportional to v^, i.e. to the energy of the particle. 
At an interface with an ideal conductor (e = oo), formula (116.9) becomes 

(co, 6 ) ^2^3 cos^e)^ ■ 

In the ultra-relativistic case (P % 1), the radiation has a maximum at small angles 
® ~ \/(l — formula (116.9) then becomes 



'i {oj, 6) = 


1^1 


«V|ye+l| [e" + (l-P»)]»' 

This gives as the frequency distribution of the total radiation, with logarithmic accuracy. 


j) = J -Zr 1 {co, 6) 

Ttc I y £ -1- I I 


log 7 


(116.10) 


t When the exponential factors in (116.5) must be written as exp l±x^(g^-co^/c^)], 

corresponding to surface waves attenuated away from the interface. Such waves are necessarily present with 
transition radiation; they will not be discussed here. 
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Using, for sufficiently high frequencies, the limiting expression (78.1) for the permittivity; 

£ = 1 —ojff l(D^, (Off = 4nNe^/m, (116.11) 


we find that for cap the intensity decreases as l/co*. Thus the frequencies oj 
do in fact give the main contribution to the transition radiation (in the backward 
direction). 

Lastly, let us consider the transition radiation when a particle goes from a medium into a 
vacuum. This problem differs from the previous one only by a change in the sign of the 
velocity v. The differential intensity of radiation into the vacuum when a charged particle 
leaves the medium is therefore given by an expression derived from (116.9) by putting — v 
for v; 6 is now the angle between the directions of k and v (and so 6 = 0 corresponds to 
radiation in exactly the direction of motion of the particle):t 


'i(m,e) = 


e^P^sin^6cos^6 


n^c{\ —p^cos^B)^ 

[1 - Py/[e — sin^e) ] [£ cos 6 -F ^(£ - sin^S) ] 


(116.12) 


In the ultra-relativistic case, the radiation has a maximum intensity at small angles 
6 ~ >/(l Formula (116.12) then becomes 


'i(m,(l) = 


e^e^ iVe-lP 

(i_^2+e2)2|i_^y(e_e2j|2' 


If £ is not too close to unity, the last factor in the denominator can be replaced by 
11 — p, and the frequency distribution is, with logarithmic accuracy. 


For high frequencies, we again use (116.11) and put, in (116.13), 

+ ^£-1^ -cooW- 


The integration over angles gives 

-^ 1 ( 0 ;) = — log^^^5_^ for Wo < oj < ojo/y/il - P^), 

nc w"{l—p) * 

These expressions show that the main contribution to the forward radiation comes from 
high frequencies, w ~ Wo/y/{i — P^) (G. M. Garibyan, 1959). The radiation energy 
integrated over all frequencies is then proportional to the particle energy: J 

mo./3cy (1 - P^). (116-14) 

t For a transparent medium (e may be taken as real), we consider only velocities c < c/ y/e. Otherwise, 
would be the question of separating the contribution of the Cherenkov radiation emitted forwards by the 
in the medium and passing through the interface into the vacuum. This process corresponds to the pole oft 
intensity (116.12) where - sin^fi) = 1. The angle 6 given by this equation is just the angle of emergence o 
the Cherenkov cone ray in the medium after refraction at the interface. 

t A more detailed account of topics related to transition radiation is to be found in the review articles by r. Cr. 
Bass and V. M. Yakovenko, Soviet Physics Uspekhi 8, 420, 1965; V. L. Ginzburg and V. N. Tsytovich, Physics 
Reports 49 , 1, 1979. 
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SCATTERING OF ELECTROMAGNETIC WAVES 


§117. The general theory of scattering in isotropic media 

In the theory of propagation of electromagnetic waves in transparent media discussed in 
previous chapters, a phenomenon has been neglected which, though not prominent, is of 
fundamental importance: scattering. Scattering results in the appearance of scattered waves 

I of small intensity, whose frequencies and directions are not those of the main wave. 
Scattering is ultimately due to the change in the motion of the charges in the medium 
under the influence of the field of the incident wave, resulting in the emission of the 
scattered waves. The microscopic mechanism of scattering must he investigated by 
quantum methods, hut this investigation is not needed in developing the macroscopic 
theory described below. We shall therefore give only some brief remarks on the nature of 
the processes which cause the change in the wave frequency on scattering. 

The basic scattering process consists in the absorption of the original quantum hco hy the 
scattering system and the simultaneous emission hy that system of another quantum hco'. 
The frequency co' of the scattered quantum may he either less or greater than w; these two 
' cases are called respectively Stokes scattering and anti-Stokes scattering. In the former case 
I the system absorbs an amount of energy h{co — co'); in the latter case it emits h{co' - co) and 
I makes a transition to a state of lower energy. In the simple case of a gas, for example, 
scattering takes place at individual molecules, and the change in frequency may be due 
/ either to a transition of the molecule to another energy level or to a change in the kinetic 

I energy of its motion. 

Another kind of process occurs when the primary quantum hco remains unchanged but 
causes the scattering system to emit two quanta: one of energy hco, with the same frequency 
and direction, and a “scattered” quantum hco'. The energy h(co + co') is obtained from the 
scattering system. Processes of this type, however, are, under ordinary conditions, very rare 
in comparison with those of the first type.f 
Proceeding now to consider the macroscopic theory of scattering, we must first make 
precise the meaning of the averaging processes performed in that theory. The averaging of 
quantities in macroscopic electrodynamics can be regarded as comprising two operations. 
If, for clarity, we take the classical view, then we can distinguish the averaging over a 
physically infinitesimal volume with a given position of all the particles in it, and the 
averaging of the result with respect to the motion of the particles. In the theory of 
scattering, however, this procedure is impossible, because the averaging with respect to the 
motion of the particles annuls the very phenomenon which is to be discussed. Thus, for 
example, the field and induction of the scattered wave which appear in the theory of 


t We shall see in §118 that this stimulated emission is unimportant at all temperatures T fi(co + co'). It may 
become significant for radio waves. 
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scattering must be taken to be those resulting from the first averaging only. It should he 
noted that in the quantum treatment one can of course refer to averaging over a volume 
only for an operator of a physical quantity, not for the quantity itself The second stage of 
the averaging consists in determining the expectation value of the operator hy means of the 
quantum prohahilities. Hence the electromagnetic quantities mentioned helow should, 
strictly speaking, he understood as quantum operators. This, however, does not affect the 
final results of the theory given in the present section, and to simplify the formulae all 
quantities will be treated as classical. 

The monochromatic components of the fields in the scattered wave, taken in this sense, 
will he denoted in this section hy E', H', D' and B'. The fields in the incident wave will he 
denoted hy the unprimed letters E, H. In the present chapter we always suppose the 
incident wave to be monochromatic with frequency oj. 

In the propagation of the scattered wave we have the relation D' = £(a)')E' between the 
electric induction and field (the scattering medium being assumed isotropic), hut this 
relation does not reveal the phenomenon of scattering, i.e. the formation of the scattered 
wave from the incident wave. To describe this, additional small terms must be included in 
the expression for D'. In the first approximation, these terms must be linear in the field of 
the incident wave. The most general form of the relation is then 

D'i = e'E', + ai^E^ + Pi^Ef^. (117.1) 

Here £' denotes £(a)'); and are tensors which characterize the scattering properties of 
the medium. In general they are not symmetrical, and their components are functions both 
of the frequency oj' of the scattered wave and of the primary frequency oj. The fact that a 
and p are tensors does not, of course, contradict the assumed isotropy of the medium. Only 
the fully averaged properties of the medium are isotropic; the local deviations from the 
average properties, which include the additional terms in (117.1), need not be isotropic. 

The last term in (117.1) pertains to the part of the scattering which results from processes 
of stimulated emission. All the terms on the right-hand side of equation (117;1) must 
correspond to the same frequency oj' as D' on the left-hand side. Since E* has the frequency 

— OJ, the frequency of the quantities Pi^ must he oj + oj' to make the frequency of the 

products Pi^E^* equal to oj'. But m -i- m' is the frequency which characterizes processes of 
stimulated emission. Because this effect is small, as mentioned above, we can neglect the 
corresponding term in (117.1), and in what follows we shall write » 

D'i = e'E'i-\-cii^E^. (117.2) 

Similar formulae give the relation between B' and H'. We shall, however, neglect the 
magnetic properties of the medium, which are usually of no importance as regards the 
scattering of light, and therefore put B' = H'. 

Maxwell’s equations for the field in the scattered wave are curl E' = ioj'H'/c, curl H' = 

— ioj'D'Ic. Eliminating H' from these equations, we find curl curl E' = • 

Substituting from (117.2) E' = D'/e' -a-E/e', where a-E denotes the vector whose 
components are and using the equation div D' = 0, we obtain for D' the equation 

AD'-l-/c'^D'= - curl curl (a-E), (117.3) 

where k' = oj^/e'jc is the wave number of the scattered wave. 

For an exact formulation of the conditions under which equation (117.3) is to be solved, 
we divide the scattering medium into small regions (whose dimensions are still large 
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compared with molecular distances). On account of the molecular nature of the scattering 
processes, their correlation at different points in the medium (assumed non-crystalline) 
extends in general only to molecular distances.t Hence the scattered light from the various 
regions is non-coherent. We can therefore treat scattering from one region as if the light 
were not scattered at all in the remainder of the medium. In this way we calculate the field of 
the scattered wave at a large distance from the scattering region. Using a well-known 
approximation for the retarded potentials at a large distance from the source (see Fields, 
§66), we can immediately derive the required solution of equation (117.3): 

D' = ^curlcurl^^^5^^j a •Eexp(-ik'-^dT. (117.4) 

Here Rq is the radius vector from some point within the scattering volume (the integration 
being over that volume) to the point where the field is to be calculated; the vector k' is in the 
direction of R^. The integral in (117.4) is independent of the coordinates of the point 
considered; retaining in the differentiation, as usual, only terms in I/Rq, we obtain 

Since, at the point considered, the medium is regarded as not scattering, the relation 
between D' and E' there is given by D' = e'E' simply. In the field of the incident wave E we 
separate a factor periodic in space, putting 

E= Eoe“‘'= £;oee*'‘^ (117.5) 

in the second equation, the complex amplitude Eq is written as E^^e, where Eq is a real 
quantity [E^^ = |Eol^), and e is a complex unit vector (e-e* = 1) which defines the 
polarization of the wave. With the notation 

G = I (a •e)e“'i 'dF, q = k —k', (117.6) 

we then have 

E' = - e><c'Ro—^k' X (k' X G) 

^tcRqE 

= (117.7) 

47tRo£ 

The vector E' is perpendicular to the direction k' of the scattered wave, and is given by the 
component G^ perpendicular to k'. 

Having thus determined the non-averaged field in the scattered wave, we can now 
investigate the intensity and polarization of the scattered light. To do so, we form the 
tensor 

f^=iE'iE'*y, (117.8) 

where the brackets denote the final averaging over the motion of the particles, which so far 
has not been carried out. The averaging of a quadratic expression gives, of course, a result 


t Exceptions may occur for particular cases of scattering, which will be discussed in §120. In such cases the 
dimensions of the scattering regions must be supposed large in comparison with the wavelength of the light. 
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which is not zero. Since E' is perpendicular to k', the tensor has non-zero components 
only in the plane perpendicular to k'. These components form a two-dimensional tensor 
in that plane (Greek suffixes take two values). The tensor is, hy definition, Hermitian; 

It t:an be diagonalized, and the ratio of its two principal values gives the degree 
of depolarization, while their sum is proportional to the total intensity.! 

The products E\E\* involve products of integrals G^, which must also be averaged. 
Writing the product as a double integral, we have 

< Gfi* > = J j > exp [ - iq • (fj - r 2 ) ] d \\ d V^. (117.9) 

The indices (1) and (2) mean that the values of a are taken at two different points in space. 
In averaging the integrand it must be remembered that the correlation between the 
values of a at different points in the body extends in general only over molecular distances. 
After averaging, therefore, the integrand will be appreciably different from zero only for 
I ''2 ~ •‘i I ~ where a is of the order of molecular distances. The exponent is ~ ajk, where 
k is the wavelength of the scattered wave; but a/A ^ 1 if the macroscopic theory is 
applicable, and so we can replace the exponential factor by unity.J 

Next, the integration with respect to the coordinates Fj and F2 can be replaced by one 
with respect to I (Fj -(-F 2)andF = F1 — F2. Since the integrand depends, after averaging, on f 
only, we have 

j<ai,'‘>aJ^>*>dF, (117.10) 

where Fis the volume of the scattering region. It is evident a priori that the scattering must 
be proportional to V. It should be noted that the direction of the wave vector k in the 
incident wave appears neither in (117.10) nor, consequently, in the following formulae. 

The integrals in (117.10) form a tensor of rank four, which depends only on the 
properties of the scattering medium. Since the medium is isotropic, this tensor must be 
expressible in terms of the unit tensor 3^^ (and scalar constants). Before giving the 
appropriate expression, we should note that the tensor like any tensor of rank two, can 
in the general case be represented as the sum of three independent parts: 

oiii, = a3i,^ ++ (117.11) 

where a is a scalar, S;^ is an irreducible (i.e. with zero trace) symmetrical tensor, and 0^, is an 
antisymmetrical tensor: * 

a = = 2i°>-ik + 0Cki~3°<-ii^ik)’ = 2(“ik““ki)- (117.12) 

On averaging the product a non-zero result can occur only for products of 

components of each of the three parts of separately; clearly, the unit tensor cannot be 
used to form an expression whose symmetry properties correspond to those of cross- 
products. It follows that the tensor of rank four can be written as 

+ iGa(^ik^ln,-^in,^kll (117-13) 


t See Fields, §50. The diagonalization of an Hermitian tensor means putting it in the form 
= + ^2«2(n2k*. where n„ nj are, in general, perpendicular complex unit vectors: n, • n,* = 1, nj • n^* - 

n, -nj* = 0. The principal values Zj of an Hermitian tensor are real. 

t This procedure requires further discussion in the case of Rayleigh scattering (§120). 
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where the symmetry of the three terms corresponds to the products of the scalar, 
symmetrical and antisymmetrical parts of the tensors a;/** and Contracting this 

expression with respect to various pairs of suffixes, we obtain three equations, from which 
the significance of the coefficients in (117.13) becomes evident: 

These quantities are real and positive.! 

The tensor 7;^ thus becomes 

= constant x [Goeie,*+i^G,(^i;^ + ei*e,-feje,*)+iG„(^i;^-ei*e^)], (117.15) 

the constant being independent of the direction of scattering and of the incident wave 
polarization. This tensor is, of course, not yet transverse to k'. The required tensor is 
obtained by projecting the tensor (117.15) on a plane perpendicular to k'; to do this, it is 
sufficient to take a coordinate system with one axis in the direction of k and find the 
components of the tensor along the other two axes. 

In the general case, scattering can be regarded as a superposition of three independent 
processes of scalar, symmetric and antisymmetric scattering, corresponding to the three 

terms in (117.15).$ e 

If this separation is not important, it may be convenient to put (117.15) m another form, 
combining similar terms: 

= constant x {^{a + c)e,^^* +Ua-c)ei*e^ + bSi^}, (117.16) 

» = G. + V5G.-iG, b-S,0. + iC, C=G„-JG. + JG, (117.17) 
Formula (117.15) or (117.16) determines the angular distribution and polarization 
properties of the scattered light. In particular, on projecting this tensor on some 
polarization vector e' (which defines the direction of E'), we obtain the intensity of the 
scattered light component polarized in a particular way, which could be distinguished by 
means of a suitable analyser: 

IuP'fe'k = constant x {Go|e-e'*|^ + l^Gj(l + |e-e'|^ 

-||e-e'*|^) + iG„(l-|e-ef)} (117-18) 


(117.14) 


t The fact that the tensor (117.13) is real, and therefore so are the coefficients in it is evident from the 
automatic symmetry of this tensor with respect to interchange of the pairs of suffixes i( and km this interchange 
being equivalent to taking the complex conjugate (since the points 1 and 2 are equivalent). The coefficients are 
positive because they can be expressed as the square of a modulus (or as the sum of such squares) by a 
transformation opposite to that used in going from (117.9) to (117.10). For example. 

Go = p<lK'dF|^>. 

t Formula (117.15) and the subsequent deductions made from it differ only in the definitioii G,and G„ 
from those in the Placzek quantum theory of scattering by separate freely onented molecules (QtD, §60). 
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or 

Jike'i*e'k = constant x {i(fl + c)|e+|(fl-c)|e-e'P +b}. (117.19) 

Let us consider the scattering of a linearly polarized wave. This corresponds to a real 
vector e; see Fields, §§48,50. The components of the scattered light tensor are therefore 
all real also. This means that the scattered light is partially polarized, and can be divided 
into two independent (non-coherent) waves, each of which is linearly polarized. Since there 
is only one distinctive direction in the plane perpendicular to k', given by the projection of 
the vector e on that plane, it is evident that one of these waves is polarized with the vector e' 
in the plane of e and k', with intensity I say, and the other is polarized perpendicular to 
that plane, with intensity I2, say.t 

When e is real, the expression (117.16) reduces to 

= constant x (ae^e^^ + bSi^). (117.20) 

This contains only two independent constants, not three. Accordingly, 
li^e\e\ = constant x [fl(e • e')^ + b ] 

and, taking e' in the two directions mentioned above, we find the angular distributions of 
the two non-coherent components of the scattered light; 

11 = constant x (usin^d + h), 12 = constant x b, (117.21) 

where 6 is the angle between e and the direction of scattering k'. The second distribution is 
isotropic. 

When natural light traverses the medium, the scattered light is partially polarized. The 
corresponding tensor 7;^ is obtained from (117.16) by averaging over all directions of e in 
the plane perpendicular to k. This averaging is effected by means of the formula 

= (117.22) 

(where n = k//c), which represents a tensor of rank two depending only on the direction of 
n, giving unity on contraction, and satisfying the condition 

n^e^e^* = (n • e)e^* = 0. 

In the scattering of natural light, therefore, * 

/jjt = constant x {|a(^i^ — n^n^) + bSi^}. (117.23) 

It is evident from symmetry that the two non-coherent components of the scattered light 
are linearly polarized, with the vector e' in the plane of k and k' (the scattering plane) and 
perpendicular to it; let the intensities of these components be I n and respectively. The 
formula 

likC'ie'k = constant x {^fl[l — (n -e')^] +h} 

gives 

I II = constant x (^a cos^ S -t- h), Ij_ = constant x (^a + b), (117.2 

where S is the scattering angle (between k and k'). 


t We must again emphasize the necessity of distinguishing the polarization state of the scattered light as 
such from the polarization e' that is registered by a detector. 
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We shall also give expressions for the angular distribution and polarization properties of 
each of the three types of scattering separately. These are found from (117.21) and (117.24) 
by simply substituting for a and b the corresponding terms in (117.17). 

In scalar scattering of linearly polarized light, the scattered light too is completely 
polarized, and its angular intensity distribution is given by 

/ = fsin^e; (117.25) 

here and henceforward, the expressions for / are normalized so as to give unity on 
averaging over directions. In scattering of natural light, however, the angular distribution 
of the intensity and the degree of depolarization (the ratio of the smaller and larger of f 
and If) of the scattered light are given byt 

/ = + /|I = 1(1 + cos^ 9), I fix = cos^ a 

For symmetric scattering of polarized light, we have 

/ = /j + /^ = ^(6 + sin^e), 12/11 = 3/(3 + sin^S), 
and for scattering of natural light 

/ = :^(13 + cos^9), I fix =4(6 + cos^9). 

Lastly, for antisymmetric scattering of polarized light, 

/ = 1(1+cos^e), / 1//2 = cos^e, 
and for scattering of natural light 

I = |(2 + sin2S), IJh = 1/(1 +sin"S). 


§118. The principle of detailed balancing applied to scattering 

The general principle of detailed balancing in quantum mechanics (see QM, §144) can be 
used to obtain a relation between the intensities in various scattering processes. 

Let dw21 be the probability that a quantum hw^ is scattered (on a path of unit length) and 
gives rise to a quantum hc32 in the solid angle element do2;t let dwi2 be the probability of 
the converse process, in which a quantum hcD2 yields a quantum in the solid angle 
element doj. According to the principle of detailed balancing we have dw2i//c2 ^02 
I = dwi2/^i^dOi, where /cj and /c2 are the wave numbers of the two quanta. Substituting kf 
= kf^ = ^2^-1!^^ (where Sj = £(mi), £2 = £(<y2))> obtain 

£imi^dw2i/do2 = £2m2^dWi2/dOi. (118.1) 

Here it is assumed that the initial and final states of the scattering system correspond to 
I discrete energy levels and E2, related by E^ + = £2 + ^*^2' This statement of the 


t The passage from the formulae for 1(6) to those for I(Si) corresponds to averaging in accordance with 
sin^6 = i(l +cos^S); 

compare the derivation of (92.6) from (92.3). 

J The two quanta also have definite polarizations, but for brevity these are not shown. The order of suffixes in 
dwj 1 corresponds to that which is usual in quantum mechanics, with the initial stale on the right and the final 
state on the left. 


(117.26) 

(117.27) 

(117.28) 

(117.29) 
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problem is not quite true to reality, since the energy levels of a macroscopic body are 
extremely closely spaced and can be regarded as quasi-continuous. 

Instead of the scattering probability dw2i with an exactly determined frequency change, 
we must therefore use the probability of scattering into a frequency range dco2, i.e. of the 
body’s entering a state whose energy lies in a range d£2 = hdco2. Denoting this probability 
(again per unit path length) by dh2i, we have d/j2j = dw2idr2 = dw2iidr2/dE2)hdco2, 
where dP2 is the number of quantum states of the body in the energy range d£2. Instead of 
(118.1), we therefore have 


dPj , dh2i 
dir j Q02^0J2 


d£2 


2 

^ doidcoj’ 


According to a well-known relation between the statistical weight of a macroscopic state 
of a body and its entropy 5^, the derivative dr/d£ is essentially exp y, so that 
(dr i/d£i):(dr2/d£2) = exp (5^ i~ ^i)- Sincetherelativechangein the energy of the body 
resulting from the scattering of one quantum is negligible, the change in entropy is 
also small, and can be taken as - 5^2 = {d.9'/dE){E^ - £2 ) = (j(co 2 - coJ/T’. Using this 
result, we can write the final expression of the principle of detailed balancing for scattering 
in the form 


2 dfj2i 

^ do2dco2 


^ doidcoj' 


(118.2) 


The quantity df!2i, whose dimensions are cm“^ is called the differential extinction 
coefficient for scattering of light. It can also be defined as follows: dfi2i is the ratio of the 
number of quanta scattered in the direction do2 and the frequency range dco2 per unit time 
and volume to the incident photon flux density. By integrating dfi2i over all directions and 
frequencies of the scattered light, we btain the total extinction coefficient, which represents 
the damping decrement of the photon flux density as the light passes through the scattering 
medium. 

Let CO2 < coj. The relation (118.2) connects the intensities (extinction coefficients) of 
Stokes (1^2) and anti-Stokes (2^1) scattering. We see that the latter is in general less 
than the former by approximately the factor -“2)/^ xhis is a very general result, and 
corresponds to the fact that the transfer of energy from the body to the electromagnetic 
field reduces the probability of the process by a factor where A£ is the energy 

transferred. In particular, the stimulated emission, in which the body gives up an energy 
fi(cOi + CO2) in each scattering process, is therefore usually very weak. The probability of 
such a process, when fi(cOj 4-0)2)^ T, contains the small factor e-''(“i+"2)/?^. 

The general relation (118.2) is much simplified in the important case of scattering with a 
relatively small change in frequency. We shall denote coj by co simply, and the small 
difference CO2 -coj by Q (|Q| < co), and put for brevity 

dh2jd02dt02 = l(w,€l). (118-3) 

In the non-exponential factors eco^ in (118.2) we can neglect the difference Q; these factors 
then cancel, leaving 

/(CO, = /(CO + ii, - D)e-«(“+^^)/^. 

In the first argument of the function Hco + il, - Q), which gives the initial frequency of the 
light, we can neglect Q, i.e. refer the scattered intensity to the slightly different frequency 
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of the incident light. Then 

l{w,Q) = I(co,-n)e-^^f ( 118 . 4 ) 

In this approximation / on each side of the equation refers to the same frequency of the 
incident light. In other words, the relation (118.4) gives a simple relation between Stokes 
and anti-Stokes scattering of the same light with the same magnitude of the frequency 
change Q. 


PROBLEM 


Determine the relation between the intensities of stimulated Raman scattering (§112) and ordinary 
(spontaneous) scattering. 


Solution. The probability of stimulated scattering is found from that of spontaneous scattering by 
multiplying by the number of photons in the quantum state with wave vector kj. In order to relate this 
number to the field Ej of the scattered wave, we must regard the latter as almost monochromatic, and equate the 
expressions for the field energy (per unit volume) in terms of the number of quanta and in terms of the field: 




(1) 


the right-hand side being written in accordance with (83.9). On the left-hand side, the factors which vary 
only slightly over a narrow frequency range can be taken outside the integral after substituting 


d^lcj = kf(Ak2lAw^&02dia2 = (£^co2^/c^)(dk2/dtt)2)tlo24m2. 

fr X, /ia)2 V*! d'ci r,,, j 

/ia)2^L , 2 ::— dco2do2, 

J “2(271)^ df02 J 


j/V|j^dtt)2do2 = 


h(02^fE2 


The incident photon flux density is, from (83.11), 

_Lc -fA 


Thus the energy transferred to the field E 2 by stimulated scattering of photons ficoj is, with the notation (118.3), 

dUi S, C 

I -^ = fic02^/(C0„n)jN^^dc02dO2 

with n = CO 2 —tOj. The field E 2 also loses energy by the stimulated scattering of photons ho)2 with their 
conversion into photons ficoj. The energy thus gained by the field E j is expressed by formula (2) with the suffixes 1 
and 2 interchanged. The energy lost by the field E 2 is then found by multiplying by -CO 2 /CO,; see (112.8). It is 

- |E2l"/(«>2, -H). (3) 

Adding the two expressions, and expressing /(CO2, - D) in terms of /(co,, D) by means of (118.2), we find the total 
energy change of the field with frequency <B2: 

The growth rate (when cOj < co,) of the scattered radiation intensity at frequency o>2, per unit path, is 
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92 = (l/S2)dl/2/d£, with dimensions 1/cm. The final form of the relation is then 


where fcj = (o^Je^Ic, and S, 



the energy flux density of the incident light 


frequency co,. 


( 5 ) 


§119. Scattering with small change of frequency 

The theory given in §117 is entirely general, and is applicable to all cases of scattering in 
an isotropic medium, whatever the mechanism of scattering. Such a general discussion, of 
course, cannot proceed very far, and a further investigation of the phenomenon of 
scattering requires some restrictive assumptions. 

Usually, the scattering of light involves only a relatively small change in frequency, 
Q = CO — CO. The calculations given below pertain to this case. Besides the condition 
|Q| ^ CO, we shall suppose that the relative change in the refractive index of the medium 
civer the frequency range fl is small. This condition means that the frequency co must not 
lie close to a range or line in which the scattering medium is also absorbing. 

If CO is in the optical range, the microscopic mechanism of sc^attering with small Q may 
involve various kinds of motion of atoms and molecules (i.e. of nuclei, as opposed to the 
purely electronic motions which give rise to optic^al transitions), including intramolecular 
vibrations of atoms, rotations or vibrations of molecules, etc.t 
Let q = q{t) denote the set of coordinates describing the motion which causes the 
scattering. (For simplicity, we shall first give a classical discussion.) Since this motion is 
relatively slow, the macroscopic description of scattering can be regarded from a different 
standpoint by introducing the permittivity tensor £ik{q), whose components at any instant 
depend only on the values of the coordinates q at that instant as parameters. This property 
follows from the assumed slowness of the relative change in e. The permittivity thus 
defined pertains to the field averaged with respect to the electron motion for a given 
position of the nuclei. When the averaging of the field (including that with respect to the 
motion of the nuclei) is carried out, the permittivity reduces to the scalar e(co). Let the 
deviation of e.^ from this value be ^e,^: 

Eiii(9) = + ^ei)i(g). (119.1) 

The tensor e,^ gives the relation between the field and the induction as functions of time. 
It should be emphasized that the incident wave is still assumed to be monochromatic with 
frequency co, but the field E' in the scattered wave is now regarded as a function of time, not 
resolved into monochromatic components. The total field consists of the field E in the 
incident wave and the field E' in the scattered wave. Thus £>, + D'; = £,.fc(£fc + F^). 
Cancelling D; = eE, and omitting the second-order term we obtain 

D\ = eE'i + 6Ei^lq)E^. (119.2) 

The relation (119.2) is of the same form as (117.2). There is a difference, however, in that 
with this approach it is clear that the tensor a.^ = ^e.^ is symmetrical. This follows at once 
from the general theorem concerning the symmetry of the permittivity tensor. 
Furthermore, since this tensor is real for a transparent medium, the tensor is also real. 


t Here it is assumed that the relevant values of n are much smaller than the electronic transition frequencies. 
This condition may be violated in a gas consisting of molecules which have a degenerate electronic ground state. 
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Since the tensor has no antisymmetrical part, there is no antisymmetric scattering 
(§117) with small change in frequency. 

Let us calculate the total scattered intensity with all frequency changes Q = ca' ~ co 4, to. 
This can easily be done as follows. In equation (117.3) for the field in the scattered wave we 
can replace k'byk = coyje/c (and take the value for to' = to); this equation does 

not then involve to', i.e. it is the same for every component of the spectral resolution of the 
field. The equation is therefore valid for the unresolved field in the scattered wave, which 
we shall denote by the same letter E'. Using the solution (117.7), we obtain 


<|Ef> 


IhTrVRo^ 


<|Gp>sin^fi 


Eq^co^ 


<|Gp>sin^e, 


where 6 is the angle between k and G, and the brackets denote, as in §117, the final average 
with respect to the motion of the particles. 

We define the extinction coefficient h as the ratio of the total intensity of light scattered 
in all directions per unit volume of the scattering medium to the incident flux density;! 


/i = -^|<|Ef>RoW, (119.3) 

in which we have put e(co') = £(01). 

For the reason given in §117, in calculating the mean value < |G|^ > the exponential 
[factor in the integrand of G is replaced by unity, so that 
I <|Gp>= Fe,e„*j<&„<i>^£j^>>dF; 

cf. (117.9), (117.10). The expression in the angle brackets is a tensor of rank two, and, since 
the medium is isotropic, it gives on averaging 

>. 

The mean square 

<|Gp> =iFj<&ifc<‘>^£jfc<^>>dF 

is then independent of the direction of scattering, and the result of the integration in 
(119.3) is 

h = f < > dF. (119.4) 

l^nc^ J 

The integrand here is the correlation function of the permittivity fluctuations at different 
points Ti and T2 in the medium at the same time; the integration is taken with respect to the 
difference of coordinates r = Tj —r2- If we return to integration over dEj and dF2, 
formula (119.4) becomes 

where < • - >^ denotes the mean square fluctuation in the volume F. It may be noted that 
the total extinction coefficient is independent of the polarization of the incident light. 


t This definition diflfers by a factor cd'/cd from the definition (in terms of the number of scattered quanta) given 
in §118. In the present case this factor may be taken as unity, and the two definitions are equivalent. 
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The formulae derived above enable us to consider scattering macroscopically as 
occurring at fiuctuational inhomogeneities in the medium. In this treatment, the angular 
distribution and spectral composition of the scattered light are determined by the 
space-time characteristics of the fluctuations, i.e. by the correlation function! 

r2)> 

between the fluctuations at different points at different instants. 

In order to see this, we expand the time-dependent quantities as a Fourier integral: 

Each component Se^^^e-'^ then acts as in the relation (117.2) between the 
monochromatic components of E and D', with to' = m -t- Q. Next, representing the 
expressions quadratic in the field as double integrals, similarly to (117.9), we easily find the 
differential extinction coefficient with respect to frequency and direction: 

I -do'^, (119.6) 

where we have used, in accordance with the notation of SP 2, Chapters VIII and IX, the 
space-time Fourier expansion component of the correlation function: 

= J J (119.7) 

(t = ti - r = Ti - r2). Formula (119.6) relates to the scattered light component with 
polarization e' recorded by an analyser. The term “spectral distribution” refers to the 
strong dependence on Q within the scattering line; the slowly varying factor co'^ is replaced 
by co"^. The factor e-'i ' is retained in the integrand (119.6). Its replacement by unity in the 
expression for the spectral distribution may be inadmissible even if it can be done for the 
scattering integrated over frequencies (see §120). 

Hitherto, the discussion has been presented in terms of classical mechanics. Jn the 
quantum description, the coordinates q, and therefore the are replaced by the 
corresponding quantum-mechanical operators in the Heisenberg representation. It can be 
shown (see the end of this section) that formula (119.6) then remains valid if 
taken as 

= I I <^£(cm(t2,r2)^£i((ti,ri)>«"'”'“‘''’dtdF. (119-8) 

The angle brackets now denote complete (both quantum and statistical) averaging over 
the state of the medium. Since the operators at different times and positions do not 

+ This function depends, of course, only on the difference r = I, - (j. The second stage of averaging (denoted 

by the angle brackets), as applied to the product of fluctuations shown, may be regarded as an averaging over the 
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commute, the order of the operators in (119.8) is significant. This non-commutativity is the 
reason for the dependence of the scattering intensity on the sign of Q in (118.4); in the 
classical limit, there is no such dependence. The quantum formula necessarily satisfies the 
relation stated. 

The extinction coefficient integrated over frequencies is obtained by integration with 
respect to Q; in view of the rapid convergence outside the absorption line, the integration 
may be extended from - oo to oo (Q is the difference to’ - to, and its positive and negative 
values are therefore physically distinct). The integral 

j e^dQ/2n = S(t), 

and the delta function is then eliminated by integrating with respect to t, so that the time- 
difference correlation function becomes a single-time function. The differential extinction 
coefficient with respect to directions is 


} do', (119.9) 

where 

(^ea^eta), = j >e-'i 'dK (119.10) 

is the Fourier component of the single-time correlation function. When q = 0, the angular 
distribution depends only on the polarization factors, and we return to the formulae 
derived previously. 

The retention of the factor e-'t ' (not equal to unity) in the integral (119.10) 
considerably complicates the angular distribution and the polarization properties of the 
scattered light.f In particular, it is not correct to divide the scattering into two parts (scalar 
and symmetric) given by the first two terms in (117.18). An essential point here is that the 
tensor (^£,,^e^„)^ need not be a true tensor as it must when q = 0; it may contain 
pseudotensor components (see Problem 2). These terms are allowable if the isotropic 
medium consists of molecules with right left asymmetry and is therefore not invariant 
under inversion. 

Finally, let us discuss briefly the quantum derivation of (119.6) and (119.8). 

The perturbation operator in the Hamiltonian of the system comprising the medium 
and the field is the integral 

r EE 

v= - (119.11) 


where E is the quantized electromagnetic field operator. After averaging over the 
stationary states of the system and statistical averaging over the Gibbs distribution, this 
operator gives the change in the free energy when there is a slow change in the 
permittivity; see (101.24). 

The operator E is expressed in terms of the annihilation and creation operators for 
photons in the state to, k, e; 
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where u = dco/dk (the normalization volume is taken as unity). This expression differs 
from the one for the field in a vacuum (see QED, §2) by the factor in the 

normalization coefficients, which originates from the factor in the energy density 
(83.9) for a plane electromagnetic wave in the medium.f 
The probability of a transition in which a photon k is absorbed, a photon k' is emitted, 
and the medium goes from a specified initial state (suffix n) to any final state (/) is (see 
QM, (40.5)) 

Ilf |2 

dw = Z|^ J <k7|K|kn>dt| (119.13) 

with the matrix element 

< k'/l F|k«> = 

n = q = k'-k. 

The integral in (119.13) is written as a double integral over d Kj d Fj dtj dt2, and we use the 
fact that 

/ 

The integrand depends only on the differences Tj — r2 and tj —12. The probability dw 
therefore contains as a factor the total observation time t. The extinction coefficient sought 
is defined as dh = dw/tu. The final statistical averaging over states of the medium gives the 
required result. 


PROBLEMS 

Problem 1. Find the general form of the polarization dependence for scattering in an isotropic medium, 
taking into account the momentum q transferred to the medium (B. Ya. Zel’dovich, 1972). 

Solution. The problem amounts to that of finding all the independent rank-four tensor combinations ■ 
having the symmetry of the tensor (&i|5|i„)n, which can be formed from the unit tensor 5ju, the antisymmetric 
unit tensor and the components of the vector v = q/q; they must be symmetrical in each pair of suffixes il and 
km, and invariant under interchange of these two pairs (which is equivalent to interchanging the poiqts rj and rj, 
and therefore to changing the sign of r) with a simultaneous change in the sign of v. These conditions are satisfied 
by the combinations 

( 1 ) 

(2) S>,S,„ + S„S>„, 

(3) 

(4) S,,v,v„+ .... 

(5) Viv,v,v„, 

( 6 ) .... 

(7) TpCjuViV^-t- . . . ; 


t The normalization coefficient in (119.12) is obtained from the condition that the eigenvalues of the field 
energy density operator should be Z {Ny., + i) ho), where are the photon quantum state occupation numbers. 
The photon energy in the medium is hw, and the momentum is hk {k = w fElc)-, these are in the exponents m 
(119.12). To avoid misunderstanding, it should be emphasized that the momentum hk includes not only the 
contribution from the field as such but also the momentum acquired by the medium in the photon emission 
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three terms obtained by symmetrization have been omitted from (4), (6) and (7). These combinations together 
correspond to an angular distribution 

/i|e-e'*p+/,{l + |e-e'|^}+/3{(e-e'*)(ve*)(ve') + c.c.}+/4{|ve|^ + |ve'|^ + 

[(e-e')(ve*)(ve'*) + c.c.]}+/5Kve)(ve')|^ + iy6v{exe* + e'*xe' + 

[exe'(e*-e'*)-c,c.]} + ./,v{e'*xe'(ve)(ve*) + e: 


:*(ve')(ve'*)+ [exe'(ve*)(ve'*)- 


■c.]} 


in which/i, . . are real functions of Cl and q. In a medium which allows inversion, only the first five terms 
appear, and the first two are equivalent to the first two in (117.18). In a medium having no centre of symmetry, the 
last two terms also occur, but they are zero if the polarizations e and e' are both linear. If the difference between 
the frequencies to and to' is neglected in q, then v = fn' —n)/|n' — n|, where n = k/lc, n' = k'/k. In this 
approximation, the term in/, is identically zero, as may be shown by a somewhat lengthy calculation in which 
each of the vectors e and e' is resolved into two components in the scattering plane and perpendicular to it. 

Problem 2. Determine the radiation from the motion of a fast particle with a speed below that of light in a 
light-scattering medium (S. P. Kapitza, 1960). 

Solution. The radiation in this case may be regarded as due to the scattering of the particle field by 
fluctuations in the permittivity of the medium. The energy emitted per unit time from unit volume when the 
field scattered is monochromatic may be written as tV = hS = /ic.ye-|E|V8it, where Sis given by (83.11) and his 
the light extinction coefficient. In this form, the expression is valid for a field E of any origin. 

The field of the moving particle has a continuous spectrum of frequencies. Hence, in order to obtain the 
radiation in the frequency range dto (from unit volume over the whole time of passage), we must replace |E|^ by 
2|E^ (r)|^ dto /2n (see Fields, §66), where E„ is a time Fourier component of the field. Integration over the volume 
gives the frequency distribution of the total radiation: 






|E„(r)|MF. 


For this expression to be valid, it is necessary that the field should change only slightly over atomic di 
more precisely over the correlation distance of the permittivity fluctuations in the medium. Moreover, in order to 
neglect the frequency shift in scattering, the velocities of the molecules in the medium must be much less than the 
particle velocity ii. 

The field of the moving particle is given by (114.7) and (114.8). We have 


E = |Eke‘''‘-'d^k/(27t)^ 


The field frequency in the particle motion is to = vk,,.. Hence d^lc = dk^d^q = v * dtod^q, so that 

E„(r)e-'“■ = -JEke■'‘•MV(27t)^ 


with k-r = tox/ii + q-r. From this, 

f 1 C d^qdV ,,, 

J IEJMf = J E,• E,, 

The integral over dx gives just the particle path length /, and that over dydz gives the delta function (2;i)^^ (q — q'). 
Thus 


In this integral, the important values of q are fairly large, such that 

where a denotes the atomic dimensions: in this range, the expression for E^ reduces to 
E^ S - S - iq[4ne/Eq^)e^‘-^', 

and the integral is logarithmically divergent. With logarithmic accuracy, the integral is to be cut off at limits 
corresponding to those of the range mentioned. This gives the following final expression for the frequency 
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distribution of the radiation intensity per unit path: 

df = dH^// 


TivV^ ^a(o(c^ 


( 1 ) 


with the extinction coefficient given by (119.5). 

This radiation is analogous to transition radiation in being independent of the mass of the particle. For 
comparable velocities (though these are on opposite sides of the limit c/ ^e), the intensity is much less than that of 
the Cherenkov radiation. For example, in gases, with ii ~ c, a comparison of (1) and (115.3) shows that as a rough 
estimate 

df/dfch ~ ~ a^/Pd\ 

where d is the distance between molecules and A ~ to/c; for h, we have used (120.4) with n — 1 ~ iVa, and a ~ is 
the polarizability of the molecule. An estimate for a liquid is obtained by putting d ^ a, which gives 

d^'/dFch ~ 


§120. Rayleigh scattering in gases and liquids 

Two types of scattering can be distinguished, depending on the change in frequency of 
the light; (1) Raman scattering, which is the Raman-Landsberg-Mandel’shtam effect and 
results in the appearance in the scattered light of lines whose frequency differs from that of 
the incident light, (2) Rayleigh scattering, in which the frequency is essentially unchanged. 

Raman scattering in gases results from a change, due to the incident light, in the 
vibrational, rotational or electronic state of the molecule. Rayleigh scattering, on the other 
hand, does not involve a change in the internal state of the molecule. In the limiting case of 
a rarefied gas, when the mean free path / of the molecules is large compared with the 
wavelength A of the light, scattering takes place independently at each molecule, and can be 
discussed microscopically, using quantum mechanics. 

Here we shall discuss the opposite limiting case, where / A,t and the Rayleigh 
scattering in gases can be divided into two parts. One part is due to irregularities in the 
orientation of the molecules (called fluctuations of anisotropy). The other part is scattering 
by fluctuations in the gas density. The orientation of the molecules is entirely changed by a 
few collisions, i.e. after a time of the order of the mean free time t. Hence the scattering by 
fluctuations of anisotropy results in the appearance of a relatively broad line with its peak at 

co' = £0 and width ~ fi/r. The scattering by fluctuations ofdensity gives a much sharper line 

superposed on the other. As we shall see below, fluctuations of density in volumes ~ are 
of importance in the scattering of light with wavelength A. Since these volumes are large, 
the fluctuations in them occur comparatively slowly, and so the scattered line is narrow. In 
what follows we shall regard this sharp line as being undisplaced. 

The scattering by density fluctuations is scalar scattering; since the density p is a scalar, so 
is the change Se in the permittivity resulting from a change in p. The change in the 
permittivity in fluctuations of anisotropy, on the other hand, is described by a symmetrical 
tensor 5e,.^ with zero trace. The latter property follows from the fact that the effect must 
vanish on averaging over all directions. Thus the scattering by anisotropy fluctuations is 
symmetric scattering. 

In liquids the situation is less simple. Raman scattering can arise only from a change in 
the vibrational or electronic state of the molecule; rotational Raman lines do not occur for 


t More precisely, the necessary condition is / < Asin^S, where 9 is the scattering angle. This is because the 
expression (119.7) involves the frequency only in the expression q (120.5). 
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scattering in liquids. The reason is that, because of the strong interaction between 
molecules in a liquid, they cannot rotate freely so as to acquire discrete rotational energy 
levels. The rotation of the molecules, therefore, like any motion in which their relative 
position changes, contributes in a liquid only to the relatively broad scattering line at 
m' = CO, which in this case may be regarded as entirely the effect of Rayleigh scattering. The 
relaxation time of such motions depends on the viscosity of the liquid. 

The possibility of separating from the total Rayleigh scattering in a liquid a part due to 
thermodynamic fluctuations (of density or temperature) depends on the magnitudes of the 
various relaxation times. It is necessary that the relaxation times of all processes of 
establishment of equilibrium in the liquid should be small in comparison with the times 
characterizing the fluctuations concerned. In such cases a narrow undisplaced line is 
observed against a less sharp background called the wing of the Rayleigh line. The 
undisplaced line is due to scalar scattering. The wing, however, does not in general 
correspond in liquids, as it does in gases, to purely symmetric scattering with no scalar part. 

The angular distribution in the undisplaced line is given by the general expressions 
(117.25) and (117.26) for scalar scattering. It is therefore sufficient to calculate the total 
extinction coefficient. Substitution of 5e,.^ = 5e5;^ in (119.5) gives 


6nc‘^ ^ 

If 8p and 8T are the changes in density and temperature, then 


( 120 . 1 ) 


8e = {dEldp)r8p + {delffr)p8T. 

According to the known results (see SP 1, §112), the fluctuations of density and 
temperature are statistically independent {(^8p8T} = 0). and their mean squares are 

< >, = T^lpc, V, < {8pfy,= {pT/ V)(8p/8P)r, 

where c„ is the specific heat per unit mass. Thus we have 



a formula first derived by A. Einstein (1910). 

This can be expressed in terms of other thermodynamic derivatives. Taking as the 
independent variables another pair of statistically independent quantities, the pressure P 
and the entropy s per unit mass, we write 

3e = {deldP\dP + {delds)j,6s 
and use the familiar expressions for the fluctuations: 


where u is the adiabatic velocity of sound 
substitutions 


[8e\ 


< (^s)^ >r = cJpV, < (6Pfyy = pTu^j V, 

the medium: = (dP/dp^. With the further 
/< 5 ,\ 1 
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we obtain Einstein’s formula in the form 




/de \ ^ pTfde\^~\ 

pcXdTJp J' 


(120.3) 


For gases formula (120.3) becomes much simpler. The permittivity of a gas (at optical 
frequencies) is almost independent of temperature, and hence the first term in the brackets 
can be neglected. The density dependence is that e — 1 is proportional to p, and hence 


p{deldp)r = e -1 = 2(n - 1), 

where n = is the refractive index. Since, from the equation of state of a perfect gas, 
{llp){dpldP)r = l/NT, where N is the number of particles in unit volume, we find that 

h = 2of(n-Yfl3nc^N. (120.4) 


This formula was first derived by Rayleigh (1881). 

Let us now examine the fine structure of the undisplaced line. This requires a 
consideration of the time variation of the fluctuations. In this respect, thermodynamic 
fluctuations fall into two classes. Adiabatic fluctuations of pressure in a fluid are 
propagated as undamped waves with the velocity of sound u; we here neglect the 
absorption of sound, since it causes only a broadening of the line (see below). Fluctuations 
of entropy at constant pressure, however, are not propagated relative to the fluid, and are 
damped only gradually as a result of thermal conduction. 

Because acoustic perturbations are propagated as waves, the time variation of the 
pressure fluctuations is correlated even over distances much greater than those between 
molecules. This fact was not important in calculating the total intensity (integrated over 
frequencies) of the scattering line, which is determined by the correlation between the 
fluctuations at different points at the same time, a correlation which extends only over 
short distances. The frequency distribution of the scattering intensity, however, is 
determined by the fluctuation correlation function for different times, and the presence of 
long-range correlation makes it necessary to retain the factor in (119.7). 

In an undamped sound wave, the frequency n and the wave vector q are related by H 
= Accordingly, the frequency resolution of the correlation function for pressure 
fluctuations (and therefore for the corresponding permittivity fluctuations) consists of two 
sharp lines at frequencies n = ±qu. The magnitude of the vector q = k' — k correspond¬ 
ing to light scattering is related to the scattering angle ^ (between k and k') by 

9 = Ik'- k| ^ (2na)/c)sin|9; (120.5) 

since n = to' — to is small, we have put here to' = co. Denoting the corresponding value of H 
by Do, we therefore have 

Hq = ± (2nmu/c)sin|9. (120.6) 

Thus the scattering by pressure fluctuations results in the appearance of a doublet (called 
the Mandel’shtam-Brillouin doublet) in which the distance 2 |flo I between the components 
depends on the angle of scattering (L. I. Mandel’shtam, 1918; L. Brillouin, 1922).+ 


t As an illustration, for the typical values u = 1.5 x 10’ cm/sec, n = 1.5, scattered light wavelength A s 
5 X 10“’ cm, scattering angle 9 = 90", the width of the doublet is iijlnc = 0.05 cm ‘. The Rayleigh line wing 
width, however, may reach 200 —150cm“‘. 
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The fluctuations of entropy have zero frequency, as stated above, and so scattering by 
them gives a further central line with n = 0 (L. Landau and G. Placzek, 1933).t 

Let us determine the intensity distribution of the undisplaced scattering between the 
doublet and the central line. By the intensity of the doublet we mean the sum of those of its 
components, i.e. twice that of either one separately.t The total extinction coefficient given 
by (120.2) or (120.3) is h = h^ + K,. 

Since the doublet lines are due to scattering by adiabatic pressure fluctuations, their 
intensity is given by the second term in (120.3), which arises from these fluctuations. The 
adiabatic derivative (de/dp\ can be related to the isothermal derivative by changing to the 
variables p and T: 



If the temperature dependence of e at constant density is neglected, then {de/dpX 
= (dejdp)r. To the same accuracy, in the total derivative written in the form (120.2) we may 
neglect the second term; see Problem 1 for a calculation in which these approximations are 
not made. Lastly, using a known thermodynamic formula for the ratio of the adiabatic 
and isothermal compressibilities (see SP 1, (16.14)) 


we obtain the Landau-Placzek formula for the doublet part of the total undisplaced line 

K/h-cJc^^ ( 120 . 8 ) 


To determine the shape of the lines, it is necessary to consider the different-times 
correlation function and take account of the dissipative processes which cause the damping 
of the fluctuations. For the pressure fluctuations, these are viscosity and thermal 
conduction. The Fourier components of the correlation function for adiabatic pressure 
fluctuations are 




pTu^y 

(0. + quy- + u^y^' 


(120.9) 


2 r / I 1 \ -1 

see SP 2, §89. The quantity y is the sound absorption coefficient per unit length; q and C are 
the viscosity coefficients and k the thermal conductivity of the medium (see FM, §77). The 
intensity distribution in the line (in each doublet component), for a given direction of 
scattering, is proportional to (120.9). Normalization to unity gives 


27r[(n-no)2+ir2] 


( 120 . 11 ) 


t In superfuid liquid helium (the isotope ‘He), entropy perturbations are propagated as weakly damped 
vibrations wiled swond sou^ waves whose velocity u, is however, much less than Ihft of ordinary sound. The 

-k* -<-.h c -»•!« 
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where T = liry. This is called the dispersionform of the line, and T is the line width. Taking q 
from (120.5), we find 


s9)rf»; + C + K(—- 




(120.12) 


Isobaric entropy fluctuations decay only by thermal conduction. Their correlation 
function is 


2c 




(120.13) 


where x = ^IP^p is the thermometric conductivity. The shape of the central line is given by 
a similar dispersion formula (120.11) but with fig = 0, and the line width is 

r = Ixq^ = (4xn^m^/c^)(l —cos 9). (120.14) 


As already mentioned at the beginning of this section, the above theory is applicable to 
scattering in a liquid if all the relaxation times in it are small compared with the times 
characterizing the fluctuations. It should be borne in mind that, in any liquid, there are 
relaxation times of various orders of magnitude. The most rapid relaxation process, 
apparently, is the decay of elastic stresses in the liquid. The corresponding Maxwellian 
relaxation time is ~ q/G, where G is the modulus of rigidity. The reorientation of the 
molecules, i.e. the decay of the anisotropy fluctuations, takes place less rapidly. The 
corresponding Debye relaxation time is where a is the dimension of the 

molecule; the difference between and x^ is particularly large in liquids with large 
molecules. Finally, various other slow relaxation processes leading to the dispersion of 
sound are also possible (e.g. chemical reactions, slow transfer of energy to vibrational 
degrees of freedom of the molecule). The important processes as regards scattering are 
those for which l/r is comparable with the frequency of the sound disturbances which 
cause the scattering. We shall merely mention that, when the viscosity of the liquid is 
sufficiently high, and so x^ > l/qu, the liquid behaves as an amorphous solid with respect 
to the scattering of light. 


PROBLEMS 


Problem 1. Find an exact formula for the ratio of intensities of the central line and the doublet in the 
undisplaced scattering line (I. L. Fabelinskil, 1956). 

Solution. As noted in the text, the second term in (120.3) gives the doublet intensity. The first term arises 
from isobaric entropy fluctuations, and therefore gives the intensity of the central line. Thus 



From the thermodynamic relations (120.7) and 



(see SP 1, (16.15)), we find 
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In the Landau-Placzek approximation, the expression in the square brackets is unity. 

Problem 2. Light is scattered in a gas whose molecules are linear, with polarizabilities a, and aj^ along and 
across the axis respectively. Determine the intensity resulting from the various types of scattering. 

Solution. The total intensity of scattered light (for given vibrational and electronic states of the molecules) 
includes the Rayleigh scattering and the rotational part of the Raman scattering. Since the scattering takes place 
independently at the individual molecules of the gas, the total extinction coefficient is most simply obtained from 
formula (924), by multiplying by the number of particles per unit volume N and replacing |aFp by = 
i(a/+2a^"); 


the polarizabilities as defined here and in §92 differ by a factor V. 

The undisplaced Rayleigh line is due to the scalar part of the polarizability, i.e. it is the same 
polarizability tensor of the molecule were The same formula, (92.4), therefore gives 


21c* 


(a|| + 2aj^)^ 


if the 

(2) 


The difference includes the background (scattering by anisotropy fluctuations) and the 

rotational Raman scattering. In order to separate the former, we must first average the polarizability tensor of the 
molecule with respect to rotation about some particular axis (perpendicular to the axis of the molecule). The 
polarizability along the axis of rotation averaged in this way is evidently aj^, and that along any direction in a 
plane perpendicular to the axis of rotation is ^(aj^ + ai). In other words, a molecule rotating about a given axis is 
to be regarded as a particle for which the principal values of the polarizability tensor are aj^, ^(aj^ + a,), ^ (aj^ + a,). 
Using these, we calculate the symmetrical tensor whose trace is zero, and then a procedure similar to 

the derivation of formulae (1) and (2) gives 


Smj)*N («! —a,)^ 
'"backg = -^^S ^ ■ 


(3) 


Finally, the intensity of the rotational Raman scattering is obtained by subtracting (2) and (3) from (1): 
^ _87:co*N 
''""“9? 2 ■ 


§121. Critical opalescence 

The isothermal compressibility (dpIdP)-,- increases without limit as the critical point is 
approached. The expression (120.2) for the total intensity due to Rayleigh scattering 
therefore increases also. This indicates a marked increase in scattering near the critical 
point, called critical opalescence.f The formula (120.2) itself is, however, inapplicable, 
because near the critical point the single-time correlation between the density fluctuations 
(and therefore the permittivity fluctuations) at different points in space extends to a 
distance of the order of the correlation radius r^, which increases without limit as the 
critical point is approached (see SP 1, §§152, 153). Here, therefore, we cannot in general 
replace the factor e-'i ' in (119.9) by unity, even when calculating the total scattered 
intensity (and not just when calculating its spectral fine structure). 


t The idea that this phenomenon is due to an increase in the density fluctuations was put forward by 
M. Smoluchowski (1908). In relation to the van der Waals theory of the critical point (see SP 1, §152) it was 
discussed by L. S. Ornstein and F. Zernike (19U). 
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This replacement is valid for all scattering angles only if 

( 121 . 1 ) 

where k = ntojc is the wave number of the scattered light. In that case, we can again use 
(120.2) for the total extinction coefficient, and it is sufficient to retain just the increasing 
first term (only the density fluctuations increase, not the temperature fluctuations). As the 
critical point is approached from any direction in the pT-plane except along the critical 
isotherm T=f, the compressibility increases according tot 

idp/dP)rOz\T-T,\-\ ySl.26. 

Since there is no reason for {dejdp)^ to become zero or infinite, the scattering intensity 
behaves similarly: 

haz\T-f\-y. ( 121 . 2 ) 

This increase involves an increased intensity of only the central component of the Rayleigh 
line, not of all the fine-structure components. From (120.8), = hcjc^. The factor Cp in 

the denominator compensates the factor {dpjdP)-^ in h, since they increase in the same 
manner. The doublet intensity therefore increases only as c^, i.e. much more slowlyj: 

h^az\T-T,\-\ as 0.1. (121.3) 

Sufficiently close to the critical point, the inequality (121.1) for a given k is certainly 
violated, and it is no longer permissible to replace the factor by unity; this occurs 
later for small scattering angles. § Let dh be the differential extinction coefficient for 
scattering into a solid-angle element do', i.e. for a given value of q = k' — k. Considering, 
for definiteness, the scattering of unpolarized light, and using the result that the angular 
dependence related to the polarization state of the light in the case of scattering by scalar 
fluctuations is given by the factor / (117.26), we have 


with 


do' 


= iSe/dp)/{8p\. 

In the immediate neighbourhood of the critical point, where 
fcr, > 1, 


(121.4) 


(121.5) 


for not too small scattering angles qr^ > 1 will also hold. In this angle range, the important 
distances in the integral {8p^)^ are r ~ 1/q ^ where the density fluctuation correlation 
function follows a power law||: 


< > oc r ^ ^ 0.04. 


t All the results used in this section concerning the variation of thermodynamic quantities near the critical 
point are given in SP 1, §153. The notation for the critical indices is the same as there. 

t All these arguments presuppose, of course, that the inequality (121.1) is compatible with the assumption that 
the substance is already in the “fluctuation region” of the neighbourhood of the critical point. 

§ It is, however, meaningful to consider scattering itself only down to angles of the order of the diffraction 
angle ~ 2/Z., where L is the linear size of the body. 

II See SP t, (148.7). Because of the formal equivalence between critical-point problems and problems of 
second-order phase transitions (with a one-dimensional order parameter), this formula applies near a critical 
point also. 
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The Fourier component of the correlation function is then 

{8p\ccq-^^^ ( 121 . 6 ) 


with a coefficient that is independent of the temperature. Thus we arrive at the following 
angle and frequency dependence of the extinction coefficient in the range under 
consideration; 


dhyzo)^ ■ 


1 + cos^ & 
(1 - cos 


do'. 


(121.7) 


For a fixed scattering angle in the range qr^ > 1, the intensity ceases to increase as the 
critical point is approached. At the critical point itself, (121.7) is valid for all angles. 


§122. Scattering in liquid crystals 

In liquid crystals there is strong scattering of light, similar in several respects to critical 
opalescence. Here we shall discuss the phenomenon only in nematic-type liquid crystals 
(P. G. de Gennes, 1968). 

Such crystals have already been mentioned at the end of §17, where an expression was 
given for their permittivity tensor: 

6;^ = £o(£o)5i^ + eo(£o)didj; (122.1) 

Eq and e„ are functions of the frequency. 

The permittivity fluctuations are due firstly to fluctuations in the direction of the 
director d, and are therefore anisotropy fluctuations. A simultaneous equal rotation of this 
direction at all points leaves the energy of the body unchanged; the long-wavelength 
fluctuations thus involve only small energy losses, and are therefore large. Large 
fluctuations of the permittivity lead in turn to strong scattering of light. 

The fluctuating quantity d may be expressed as 

d = do-t-v, (122.2) 

where do is its constant mean value and v a small change in the fluctuation; since 
d^ = do^ = 1, and v is small, we have 

vdo = 0. (122.3) 

The change in the permittivity in the fluctuation is given in terms of v by 

5£i^ = £jdo,.v*-t-do^Vi). (122.4) 

The influence of the density and temperature fluctuations on is negligible. 

Using (122.4), we express the single-time correlation function of the permittivity 
fluctuations in terms of that of the director fluctuations: 

{doidokiv,v„\+doido„(v^Vi\ 

+ ‘lo.do,(ViV„),-Hdo,do„(v,v,),}. (122.5) 

The function (v,. v^), has been derived in SP I, §141. From (122.3), this tensor has non-zero 
components only in the plane perpendicular to do. Denoting the tensor suffixes in this 
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plane by a and P, we have 

K''p)q = a{6^p-q^qplqf^) + bq^qglqf^, 
a = 7y(a2 9j.^+ 03 911^), t> = T/{a^q^^^ +a3qt^), 
where Oi, 02. positive moduli giving the dependence of the free energy of a nematic 
crystal on the derivatives of the director, q „ and q^ the components of q along dp and in the 
plane perpendicular to it. 

If we assume that the anisotropy of the permittivity is relatively small, i.e. |e„ | <^ Eq, we 
can use for the differential extinction coefficient (with respect to directions) formula 
(119.9), derived for isotropic media (but without putting q = 0, of course). We shall not 
give here the resulting lengthy expression for the angular distribution and the polarization 
dependences. 

A characteristic feature of this scattering is that its intensity increases at low scattering 
angles according to 

dh oc do'/q^. (122.7) 

This large increase, like that given by (121.7) at the critical point, is due to the slow (power- 
law) decrease of the fluctuation correlation function with increasing distance. The angle 
integral diverges logarithmically at the lower limit. It has to be cut off at values of 9 ~ 1/L, 
corresponding to diffraction by the body as a whole (cf the fourth footnote to §121), and 
the total scattering intensity thus varies logarithmically with the size L of the body. 

It may be noted, finally, that an external magnetic field restricts the increase of the 
correlation function (vjv^)^ for small q (see SP 1, §141), and thus suppresses scattering, 
making the liquid crystal transparent. 

§123. Scattering in amorphous solids 

Rayleigh scattering in amorphous solidsf differs considerably from that in fluids. In an 
isotropic solid there are two velocities of propagation of sound, u, (longitudinal) and u, 
(transverse). The fine structure of the Rayleigh line therefore includes not one but two 
Mandel’shtam-Brillouin doublets. They are due to scattering by transverse and longi¬ 
tudinal “sound waves”, and their distances from the centre of the line are respectively 

= +M|9andi2, = +M,9.SinceM, > w,, it follows that |n,| > |n,|. The central component 
of the line is again due to scattering by fluctuations which are not propagated relative to 
the medium. In this case the main fluctuations of the latter type are those of structure. In an 
amorphous body, where the atoms are not arranged in an ordered manner, these 
fluctuations are comparatively large and vary only slowly with time (on account of the 
extreme slowness of the diffusion processes in a solid). Scattering by these fluctuations 
leads to a strong line whose width is almost zero. As regards polarization and angular 
distribution, this line results from a superposition of scalar and symmetric scattering. 

Next, let us consider the doublet components of the Rayleigh line in amorphous solids. 
In a solid, the influence of any deformation (in this case, fluctuations) extends to 
considerable distances. Hence even the single-time fluctuations at different points in the 
body are correlated over distances large compared with 1Z^. We thus have again a situation 


( 122 . 6 ) 


t The considerably 


complex theory of scattering in solid crystals will not be discussed in this book. 
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where, even in calculating the total intensity (and the polarization) of the scattered light, 
we cannot put q = 0 in the fluctuation correlation function. 

The field in the scattered wave is 


where 


E' = - 


<M^exp(j7cRo) 

AtiRqC^ 


£on'x(n'xG), 


(123.1) 


G; = j ^Ei^exp) — iq • r)d V.e^, 


(123.2) 


and n' is a unit vector in the direction of scattering. The change in the permittivity resulting 
from the deformation of an isotropic body is 




(123.3) 


where is the strain tensor (see (102.1)). Since the integral (123.2) isolates from ^E;* the 
Fourier space component with wave vector q, in (123.3) must be taken as the 
deformation in a sound wave with this wave vector. We therefore write the displacement 
vector as 

u = re{uoexp(iq • r)} = ^[Uoexp(iq • r) + Uo*exp( - iq • r)], (123.4) 


whence the strain tensor is 

1/ dui 

= re{ii(Moigk + Mo)iqi)exp(iq • r)}. 


and the volume integral is 

I Uikexp(-/q-r)dF = iiF(Moiqk + Mo)i9i)- (123.5) 

Let us first consider scattering by transverse sound waves. Since in a transverse wave u is 
perpendicular to q, and u,, = 0, = OiUii- Using (123.5), we therefore have 

G =iiFai{uo(q-e) + q(Uo-e)}. (123.6) 


A transverse sound wave can have two independent directions of polarization; the vector u 
may be in the plane of k and k', or perpendicular to that plane. Since E is perpendicular to 
k, it is easy to see that in the first case the component of G in the plane perpendicular to k is 
zero. Thus transverse sound waves polarized in the plane of k and k' do not scatter light. 

If the vector u is perpendicular to the plane of k and k', a simple calculation, using 
(123.1) and (123.6), gives for the field in the scattered wave 


£'ii = .ej_, 

4nR(,c 


AnRfyC^ 


(123.7) 


Here ^ is, as usual, the angle between k and k', and the suffixes || and L denote components 
in the plane of scattering and perpendicular to that plane. The coefficient of proportionality 
in these two formulae involves the same fluctuation Mq. This means that no depolarization 
occurs on scattering: linearly polarized light remains so (though it is polarized in a 
different plane). 
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Since the coefficients in formulae (123.7) are exactly the same, the extinction coefficient 
dll does not depend on the state of polarization of the incident light, and is 

dli = (^^)V|uol"cosHSdo. (123.8) 

It remains to determine the mean square amplitude of the fluctuation Uq. 

From the point of view of the general theory of thermodynamic fluctuations, the sound 
wave (123.4) may be regarded as a combination of two classical oscillators (waves 
propagated to the right and to the left), each having a mean kinetic energy ^T. Since the 

frequency of the oscillations is here il = qu„ the mean kinetic energy is 

= sFp(M,q)^<|uol^>- Equating this to 2.^7', we have 

<|uol^>=47'/Fpu,V- (123.9) 



The angular dependence of the scattering is totally different from that which occurs in 
fluids. 

Let us now consider scattering by longitudinal sound waves. In these waves u is parallel 
to q, and from (123.3) and (123.4) we find 


A simple calculation gives for the field in the scattered wave 
m^exp(jkRo) ic- 


liVuoqlhi + (ioi + a2)cos S]£oeii. 


In this case also there is no depolarization on scattering. The angular distribution and the 
extinction coefficient, however, depend on the state and direction of the polarization of the 
incident light. We shall not pause to write out the relevant formulae, which are somewhat 
cumbersome. The calculations are wholly similar to those given above, and the expression 
for <|moI^ > differs only in that u, is replaced by u, in (123.9). 



CHAPTER XVI 


DIFFRACTION OF X-RAYS IN CRYSTALS 


§124. The general theory of X-ray diffraction 

The phenomenon of X-ray diffraction in crystals occupies a special place in the 
electrodynamics of matter, since the wavelengths concerned are comparable with the 
interatomic distances. For this reason the usual macroscopic approach to matter as a 
continuous medium is entirely invalid, and we must begin by considering scattering by 
individual charged particles, and essentially by electrons; the scattering by nuclei is 
unimportant, because of their greater mass. 

The frequencies of the motion of electrons in the atom are of order Mo ~ *’/«> where v is 
their velocity and a the dimension of the atom. If A ~ a, then, since v^c, these frequencies 
are small compared with the X-ray frequency m ~ cjX. This makes it possible to write the 
equation of motion of an electron in the field of the electromagnetic wave as 

mV = cE, (124.1) 


i.e. the electrons may be regarded as free (see §78). 

From (124.1) we find the additional velocity acquired by the electron under the action of 
the wave field: v' = ieEfttm. 

Let «(r) be the number density of electrons in a crystal, averaged over the quantum- 
mechanical states of the electrons and over the statistical distribution of the thermal 
motion of the nuclei in the lattice. It should be emphasized that the usual macroscopic 
averaging over physically infinitesimal volume elements is not included, i.e. «(r) is t e 
actual quantum density of the electrons in the crystal lattice. The corresponding current 
density due to the wave field is: 

j' = enV = ie^nE/mo). (124.2) 

We substitute this current in the microscopic Maxwell’s equations: 
curl E = icoH/c, 
curl H = - icoE/c + Anj'/c 



We thereby take account of its reciprocal effect on the field, i.e. scattering. It is, of course, 
assumed that this effect is small, i.e. that the inequality 

Ane^n/mm^ ^ 1 (124.5) 

holds. Putting D = eE, where 

(124.6) 
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in accordance with the usual definition of the induction, we reduce equation (124.4) to the 
usual form curlH= —icoD/c. Thus, in this sense, the expression (124.6) for the 
permittivity (cf (78.1)) can be used even for wavelengths A ~ a, though it must of course be 
remembered that the symbols E and D no longer retain their previous meanings; they now 
pertain to the field which has not been averaged over physically infinitesimal volumes, and 
£ is accordingly a function of the coordinates. 

In the scattering of X-rays by heavy atoms it may happen that the condition m g> coq is 
fulfilled for the outer electron shells but not for the inner ones, where w 5 Mq and so the 
inequality a holds. In this case the permittivity can still be regarded as the coefficient of 
proportionality between D and E, but the formula corresponding to (124.6) gives only the 
contribution of the outer electrons. That of the inner electrons must in principle be 
calculated by averaging over the volume of their shells. Thus, if we put D = eE with e a 
function of the coordinates, all possible cases are allowed for. In what follows we shall, for 
definiteness, use the expression (124.6). 

In effecting the averaging of the electron density in (124.2) to obtain «(r) independent of 
time, we exclude a possible change of frequency on scattering. That is, we consider only 
strictly coherent scattering, with no change in frequency. 

Eliminating H from the two equations (124.3) and (124.4), we obtain curl curlE 
= Here we substitute E = D + 4ne^nE/ntco^ and expand the expression 

curl curl E, using the fact that div D = 0, as follows from (124.4). Then 

AD + m^D/c^ = curl curl (47rc^«E/mm^). (124.7) 

On the right-hand side of this equation, which already contains the small quantity 
Ane^nlmof, E must be taken as the given field of the incident wave. Let us find the 
solution of equation (124.7) in the region outside the scattering crystal and at large 
distances from it. Since this equation is of the same form as equation (117.3), the required 
solution is obtained immediately by analogy with (117.4):t 

E = x(k' xEo)|« exp( — iq •r)dL. (124.8) 

Here Rq is the distance from the origin, which is within the crystal, to the point considered; 
q = k' — k; k = k' = m/c; Eq is the amplitude of the incident wave. We put E instead of D 
on the left-hand side because the two are equal in the vacuum outside the crystal. 

To characterize the intensity of X-ray diffraction we use an effective cross-section c, 
defined as the ratio of the intensity diffracted into a solid angle do' to the energy flux 
density in the incident wave. By (124.8) we have 

do = sin^6|j«exp( — jq •r)dF|^do', (124.9) 

where 6 is the angle between Eq and k'. If the incident radiation is “natural” (not polarized), 
the factor sin^d in this formula becomes ^(1 + cos^ y), where y is the angle between k and 


t In solving equation (117.3) it was not possible to consider the field outside the body, since the boundary 
conditions on the surface would have had to be taken into account (the quantity e' on the left-hand side being 
different inside and outside the body). The left-hand side of equation (124.7), however, is the same in all space. 
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k' (see the last footnote to §117): 

d(7 = i(^) (l+cos^S)|j«exp(-/qT)dF|Mo'. (124.10) 

In what follows we shall, for definiteness, consider this particular case. 

We see that the intensity of radiation diffracted in a given direction is essentially 
proportional to the squared modulus of the integral 

j«exp(-iqT)dK, (124.11) 

i.e. the Fourier space component of the electron density. As q -> 0 this integral becomes 
simply the electron density h averaged over a lattice cell. If n is replaced by h in equations 
(124.3) and (124.4), we obtain the usual macroscopic Maxwell’s equations, with 
permittivity £(m) = I —Ane^h/mco^. According to these equations, when X-rays pass 
through a crystal they are refracted according to the ordinary laws of refraction, with 
refractive index .^e. Thus diffraction through small angles amounts to ordinary 
refraction, which is of no interest here. In what follows we shall always assume that q is 
appreciably different from zero. 

The electron density, like any function of position in a crystal lattice, can be expanded as 
a Fourier series; 

« = X”bexp(ibT), (124.12) 

where the summation is taken over all periods b of the reciprocal lattice (see SP 1, §133). 
When (124.12) is substituted in (124.11) and the result is integrated over the volume of the 
crystal, we obtain practically zero except for values of q close to some b. Between these 
values the intensity is negligible. We can therefore consider each diffraction maximum 
separately, putting 

n = «bexp(jbT) 

with the appropriate value of b. Substitution in (124.10) gives 

d(7 = ^^^) (l+cos^S)|«/||exp[-j(k'-k-b)T]dF|Mo'. (124.13) 

The strongest maxima occur in directions for which the equation 

k'-k=b (124.14) 

{Lane's equation) is exactly satisfied, and are called principal maxima. For given b, however, 
a principal maximum does not occur for an arbitrary direction and frequency of the 
incident radiation. If the equation (124.14) is written as k' = k + b and squared, and we use 
the fact that = k’^, we have 

2b-k=-fo^. (124.15) 

This equation determines the values of the wave vector k for which principal maxima occur 
with the given value -of b. Geometrically, equation (124.15) represents a plane in 
k-space perpendicular to the vector b at a distance from the origin. In particular, we see 
that k > jb. 

Since |k'-k| = Iksin^S, it follows from (124.14) that 
2k sin 5 y = fc 


(124.16) 
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{Bragg and Vulf's equation), which determines the angle of diffraction at the principal 
maximum. 

Any vector b of the reciprocal lattice determines a family of crystal planes represented by 
the equations 

r • b = 27r X an integer. 

These planes are perpendicular to b, and the vectors k and k' corresponding to the 
condition (124.14) make equal angles of incidence and reflection with the planes (Fig. 64). 
For this reason, diffraction at a principal maximum is sometimes spoken of as reflection 
from the corresponding crystal planes. 



Fig. 64 


The total intensity of the diffraction spot near a maximum is obtained by integrating 
(124.13) over a solid angle about the direction of k'. Let us determine the total intensity 
near a principal maximum. We denote by k'^ the value of k' corresponding to Laue’s 
equation for a given k: k'^ = k + b, and put also x = k' — k'o. Near the maximum, x is 
small; since k' and k'^ differ only in direction, x is perpendicular to k'^. The solid angle 
element can therefore be written 

do' = dK^^dx^/k'^ = dK,^dKjk^, (124.17) 

where the z-axis is taken in the direction of k'o- Thus 

a = (1 +cos^ S)|«b|^|j dK:,,.dK:j,||exp( —ixT)dF|^. 

In the volume integral we can effect the integration with respect to z, since exp( — ix-r) 
is independent of z; | exp( —ix-rldF = |Zexp( —ix •r)d/ where d/= dx dy and 
Z = Z(x, y) is the length of the body in the direction of k'^. Finally, using a well-known 
formula in the theory of Fourier integrals: 

\(l)J^dK,dK = |(/)Mxdy, (124.18) 

(27r) 


where 


(/)^= I (/)(x,y)exp(-ixT)dxdy 
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are the two-dimensional Fourier components, we obtain 

cT = +cos^y)l«bl^lzM/ 

= ~ (^^Jsin^i.9(l -Hcos^S)|nJ^| ZM/ (124.19) 

The integral is of the order of Lf, where L is the linear dimension of the body. Thus the 
total diffraction cross-section, and therefore the total intensity of the spot, are propor¬ 
tional to where V is the volume of the body. The maximum intensity, however, 

follows a different law. For k' —k = b, the integral in (124.13) is just F. and so da is 
proportional to F^; 

(^) =i(^J(l+cos^S)|nJ^F^. (124.20) 

The sharpness of the maximum is shown by the fact that the maximum intensity is 
proportional to a higher power of F than the total intensity. The width of the peak is 
evidently proportional to 

The theory given above is valid only if the diffraction effect is small. We now see that this 
requirement imposes a certain condition on the size of the crystal: a must be small 
compared with the geometrical cross-section of the body ( ~ L^), whence 

(124.21) 

me k 

If this condition is not satisfied, then perturbation theory as used in the derivation of 
(124.8) is not valid.t 


PROBLEMS 

Problem 1. Determine the intensity distribution in the diffraction spot round a principal maximum in 
diffraction by a crystal in the form of a cuboid with sides L^, L^. 

Solution. As above, we use the vector x = k' - k'o, and take the axes of coordinates parallel to the sides of 
the cuboid, with the origin at its centre. 

The integral Jexp(-ix-rldK becomes a product of three integrals of the form 

j exp( — iKx) dx = - sin \kL. 

Thus 

dc = (1 +cos" 

The components of the vector x are not independent, being related by the condition x • k'o = 0. 

Problem 2. The same as Problem 1. but for diffraction by a spherical crysial with radius a. 


t The dynamical theory of scattering, which is not subject to the restriction (124.21), is given by Z. G. Pinsker, 
Dynamical Scattering oj X-Rays in Crystals, Springer, Berlin, 1978. The theory given above is called the kinematic 
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Solution. We again put x = k' — k'o, and take the z-axis in the direction of x, with the origin at the centre 
of the sphere. Then 


jexp( - IKZ) d K = J 7r(a^ - z^) exp( - ikz) dz 


Thus 


4k 

= ^(sin Kti — KQ cos kq). 
cos^ 9)|M|,P—(sin kq — kq cos kq)^ do. 


Problem 3. Determine the total intensity of the diffraction spot round a subsidiary maximum. 

Solution. In this case the wave vector k of the incident wave does not satisfy the condition (124.15). As 
shown above, (124.15) is the equation of a plane perpendicular to the vector b. Let the small displacement of the 
end-point of the vector k from this plane be t;b, where That is, we put k = kn + t/b, where ko satisfies 

equation (124.15) (Fig. 65). 



The maximum intensity in the spot occurs for a direction of k' for which the difference k' - (k -L b) has its least 
magnitude (so that the integral in (124.13) has its maximum value). The magnitude of the difference of two vectors, 
one of which is arbitrary in direction, has its least value when they are parallel. Hence, since k’ = k, we have 

|k'-k-bU.„ = k-|k-Lb| 


k"-(k-hb)" 
k + lk-l-bl • 

Since k is close to kp and we are considering the region near the maximum, k' S k -L b and the denominator can be 
replaced by 2k. In the numerator, we expand the squared parenthesis and obtain 


-2k-b-b"= [-2ko-b-b"]-2»;b"= -2nb\ 

Thus |k' - k - b|n,in S - rjb^lk. 

Next, we put 


and take the z-axis in the direction of k -T b. This reduces the problem to the calculation of the integral (cf. the 
derivation of formula (124.19)) 


J JdK.dKjJ exp{itib^z/k -ix-i}dV\^- 


=j'jdK,dx^lJexp)- ix • r)— 
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Finally, using formula (124.18), we obtain 



+ cos"9)|«bl" 


• sinHir,b^Z/k) 


d/ 


As t;->0 this formula becomes (124.19). If 1 (which is compatible with ij < 1), the squared 

replaced by its mean value and we have 


where S is the area of the “shadow”, i.e. the projection of the body on the xy-plane. 
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§125. The integral intensity 

The formulae derived in §124 give the diffracted intensity when a strictly plane 
monochromatic wave is incident on a crystal. Let us now consider some cases where these 
conditions are not fulfilled. 

First, let the incident wave be plane but not monochromatic,t its spectral resolution 
including waves with wave vectors k whose directions are the same but whose magnitudes 
k = co/c are not. Let p{k) be the frequency distribution of the incident radiation intensity, 
normalized by the condition jp(k)dk = 1. 

The total intensity of the diffraction spot is determined by the cross-section, which is 
obtained by multiplying the expression (124.13) by p(k) and integrating with respect to o' 
and k 

a = 1111 exp [ - i(k' - k - b) • r ] d F|^ (1 + cos^ S)p(/c) do' dk. (125.1) 

We put temporarily K = k' - k - b and write the squared modulus as a double integral: 

Ijexp(-iKT)dF|^ = jjexp [iK•(r2-ri)]dFidF2. 

Using instead of Tj and r2 the variables ^(Fj +r2) and r = r2 — Fj and integrating with 
respect to the first gives ||exp (iK •r)dFp = F|exp(iK-r)dF. In the remaining integral 
we can effect the integration over all space,t and the result is 

||exp(iK -rldFI^ = (2nfVd(K). (125.2) 

Substituting this result in (125.1), we obtain 

a = 471^^-^^ |«b|^ F(1 + cos^ Sq) I j ^(k' — k — b)p(k)do'dk; (125.3) 

on account of the presence of the delta function, the factor 1 + cos^ S in the integrand can 
be replaced by its value at S = Sq, where ifo is the angle between the k and k' which satisfy 
Laue’s conditions (denoted by Icq and k'^ = kp 4- b). 

The integration with respect to o' can be carried out by noticing that it is equivalent to an 
integration with respect to 

d^k' = k'^dk'do' = jk'dik'^)do', 

if an additional factor (2lk)6(k'^ — k^) is included in the integrand. Thus the integral in 


t Corresponding to Laue's method in the X-ray analysis of crystals. 

t This is possible because we require only the total intensity of the diffraction spot, and no 


5t its width. 
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(125.3) becomes 




^(k' - k - h)S(k’^ - k^)p(k) d^/c' dk. 


Effecting the integration with respect to d^k' by means of the first delta function, we can 
replace k'^ by (k + b)^ in the second delta function, and the result is 


K|2F(l+cos^So)j'~^(2b-k + fo2)p(lc)dE (125.4) 

Finally, we have to carry out the integration over k (the direction n = k/k being given). 
The argument of the delta function is zero for k = k^, and the integral is ip(ko)/^olb ' "I 
= ip(^o)/|b • K\ = Thus 


WK)ib^- 


(125.5) 


Let us now consider another case, where the incident wave is monochromatic but its 
components have varying directions of k which differ by rotation about some axis;! let 1 be 
a unit vector along that axis, and i/' the angle of rotation about it. Let p(i/^) be the angular 
distribution of the incident radiation intensity, normalized by the condition 


p(iA)diA = 1. 


The calculations leading to formula (125.4) are valid in this case also, except that the 
integration with p(k) dk must be replaced by one with p(i/') dil/: 

^ +cos2So)|~^(2b-k + fo2)p(iA)dt/^. (125.6) 

We again denote by kp the value of k for which the argument of the delta function is zero, 
and measure ijj from the plane of 1 and k^. For small ij/, k = k^ + (1 x ko)i/'. Then the 
integral in (125.6) becomes 


Thus 


-^[2b-(l xko)iA]p(iA)diA =ip(0)/k|b-(l xko)] 

= ip(0)A^|b-(lxno)| 

= 2p(0)sin2 (i .9o)/fo^|b • (1 x no)|. 


a 


2(2nf/e^ ' 


sin^^So(l +cos^9o)|«bpF 


P(0) 

|b-(lxno)|' 


(125.7) 


Finally, let us consider the diffraction of a plane monochromatic wave from a body 
consisting of crystallites arranged at random.f 


t Corresponding to Bragg’s method (the rotation method) in X-ray analysis. The rotation referred to is that of 
the crystal about 1, not that of the direction of k. 

t Corresponding to Debye and Scherrer’s method (the powder method) in X-ray analysis. 
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§126 Diffuse thermal scattering of X-rays 

Let k'o and b^, be values of k' and b such that Laue’s condition k'o = k + bp is satisfied. 
The directions of k',, and b,, are not uniquely determined, since Laue’s condition is, of 
course, still fulfilled when the triangle k, bp, k'o is rotated about the direction of k. Thus the 
principal maximum corresponds to directions of k' occupying a conical surface with 
vertical angle 2 Sq. Instead of a diffraction spot we now have a ring. 

The required total cross-section is determined by a formula which differs from (125.4) 
only in that the integration with p{k)dk is replaced by an averaging over the directions of b: 

= i2nfy(^] Kp(l+cos"9o) f7.5(2b-k + h")^", (125.8) 

\mc J j k 47t 

where dob is an element of solid angle about the direction of b. Denoting by a the angle 
between k and b, we can write the integral in (125.8) as 

n SOI,/ , ,2^27tdcosa 1 1 . 21 n 

J j5(2Mcos« +(. = jjsmH S.. 

Thus 

a = (27t)^^^^ +cos^9o)sin^iV (125.9) 

Each of the three cases considered in this section corresponds to a particular method of 
averaging the diffraction pattern. The dependence of the total averaged diffraction 
intensity on the volume of the body reduces, as we should expect, to a simple 
proportionality. In the pattern which is not averaged, the intensity and its distribution over 
the spot depend more markedly on the volume. 


§126. Diffuse thermal scattering of X-rays 

In §§124 and 125 we have taken n(x, y, z) to be the time-averaged electron density in the 
cfystal: various density oscillations were thereby excluded, and consequently so was the 
corresponding (non-coherent) scattering of X-rays. One cause of non-coherent scattering 
is the thermal fluctuations of density. This scattering is diffusely distributed in all 
directions, but it is characterized by a relatively high intensity near directions correspond¬ 
ing to the sharp lines of the structural scattering described in the preceding sections. Here 
we shall discuss these maxima of the thermal scattering (W. H. Zachariasen, 1940). 

The thermal oscillations of the crystal lattice can be represented as combinations of 
sound waves. As we shall see, the maxima of the thermal scattering arise from wavelengths 
large compared with the lattice constant. The change in the electron density due to such a 
wave can be regarded, at any point, as due to a simple displacement of the lattice by an 
amount equal to the local value of the displacement vector u in the wave. Thus the change 
in the density {not averaged with respect to time) when a given sound wave passes can be 
expressed in terms of the mean density by 

dn = n(r - u) - n(r) ^ - u • dn/dr. 

In considering diffuse scattering near a given line, we must replace n by n^ expfib • r) with 
the appropriate b, so that 


Sn = — ib • uUb exp(ib • r). 


( 126 . 1 ) 
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The scattering by density fluctuations is, of course, not coherent with that by the mean 
density, and the two therefore do not interfere. Hence the cross-section for diffuse 
scattering can be obtained from (124.10), substituting bn for n and then carrying out the 
statistical averaging over fluctuations: 

dff = ^(^) |nbP(l-bcos^9)<|J’b-uexp{-jKT)dF|^>do', (126.2) 

where K = k' — k — b. The scattered intensity is large for directions where K-4b. 

The integral J u exp( — /K • r) d F gives the Fourier space component of u whose wave 
vector is K, and we can therefore take u to be simply the displacement vector in a sound 
wave having this wave vector. The inequality K b thus implies that the wavelength 
of the scattering sound wave is large compared with the dimension of the crystal lattice cell. 

Thus we can put 

u = j[Uo exp(iK • r) + Uq* exp( - /K • r)], (126.3) 

so that J(b • u) exp( — /K • r) d F == ^Fb • Uq and the cross-section is 

= \{^ l”bP(l +cos^9)hA<Wo,Wofc>^'"do'. (126.4) 

The products of the components of Uq are averaged as in §123 for a sound wave in an 
isotropic medium. The elastic energy per unit volume of a deformed crystal is 
where is the strain tensor and the elastic modulus tensor (see TE, §10). Hence the 
mean elastic energy of the whole crystal is 2^'^iwrn<“ifc“irn>- We substitute 
l(^u^ duA 

= jre{(/KfcMoi -I- iKiUfy^ exp(iK • r)}. 

The terms containing exp( + 2iK • r) give zero on averaging. Using also the symmetry of the 
tensor with respect to interchange of i, k, or /, m, or /, k and /, m, we obtain 
i^^ikimKkK„<,UoiUo,*'> or where 

gik = ^ukn.K,K„. (126.5) 

According to the general theory of thermodynamic fluctuations, we can at once write 
down the required mean valuesrT 

<WoiWofc*> = (47'/F)3-F„ (126.6) 

where is the tensor inverse to and the scattering cross-section is 

dff = TV\ni\^{\ -l-cos^ 9)h;hfcg“\.fcdo'. (126.7) 

Thus the diffusely scattered intensity is, as we should expect, proportional to the volume 
of the crystal. A characteristic feature of this scattering is the way in which its intensity is 
distributed over the area of the spot. Apart from the factor 1 + cos^ 9, which is almost 


t See SP 1, §111. If the probability distribution for fluctuating quantities Xj.Xj, . . . is of the form 
Exp( — jAjjXjXj), then <x,Xj> = ',j. A factor 2 in (126.6) appears because each of the complex Uoj involves two 

independent quantities. 
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constant for a given spot, the intensity is given by the expression This expression 

is the product of 1 jK^ and a fairly involved function of the direction of the vector K with 
respect to the crystal axes. For scattering near a principal maximum the diffusely scattered 
intensity is itself a maximum where K = 0 (the expression (126.7) becomes infinite for 
K = 0 and is, of course, invalid). If the condition (124.15) 2b-k = -b^ is not satisfied, 
however, K cannot be zero, and the maximum of the diffusely scattered intensity lies at 
some K different from zero, which in general does not coincide with the maximum of the 
structural scattering. In either case the diffuse scattering forms a background whose 
intensity falls off essentially as 1 jK^, that is, considerably more slowly than the intensity in 
the sharp structural-scattering line superposed upon it. 

§127. The temperature dependence of the diffraction cross-section 

Let us now calculate the temperature dependence of the cross-section for coherent 
scattering of X-rays. This amounts to determining the temperature dependence of the 
microscopic electron density in the crystal averaged with allowance for the thermal motion 
of the atoms. 

We shall assume the atoms to be so heavy that the majority of their electrons are 
localized in non-overlapping shells which are only slightly deformed by the lattice 
vibrations. We shall also assume that the lattice consists of atoms of only one kind, with 
one atom in each unit cell; it must be emphasized that this assumption has no fundamental 
significance, and is made only to simplify the writing of the formulae. 

The exact (not averaged) microscopic electron density may then be expressed as 

n{r) = i:F(r-r„) = i:F(r-r„o-u^, (127.1) 

where F(r) is the electron density in a single atom (the atomicform factor); the summation is 
taken over all the atoms in the lattice, and r„ are the position vectors of the nuclei, labelled 
by the vector suffix n (with integral components). Denoting the equilibrium position 
vectors of the nuclei (at the lattice sites) by r^ and the displacement vectors of the atoms 
from these positions by u„, we have r„ = r„o+ used in the second equation (127.1). 

Expanding the density (127.1) as a Fourier series (124.12) in the volume V of the lattice, 
we can write the expansion coefficients as 

exp[ - i{r^ -b uj • b]f b, 

where 

Ffc = jF(r)exp(-ibT)dF (127.2) 

are the Fourier components of the atomic form factor. All the products r„o • b are integral 
multiples of 27c. Hence all the factors exp( - ir^ • b) = 1, and so 

”b = (Fb/F)i;exp( -iu„-b). (127.3) 

This expression can now be averaged over the motion of the atoms. It is evident that the 
mean values of the terms in the sum do not depend on n. Hence 

<''b> = (FJv) <exp( - lb • u)>. 


(127.4) 
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where u is the displacement vector of any atom, and v = VfN is the unit cell volume 
(N being the number of cells in the volume V). The averaging in (127.4) is to be taken as a 
complete statistical averaging, i.e. over the wave functions of the stationary states and then 
over the Gibbs distribution. 

To carry out this averaging, we have to regard u as a quantum-mechanical operator 

/ fi \l/2 

‘“IX lMNo.Jk) ) + (127.5) 

see SP 1, §72. The summation is taken over all values of the phonon wave vector k (in the 
volume V) and over the independent phonon polarizations labelled by a = 1, 2, 3; m„(k) 
are the phonon frequencies, the phonon polarization vectors, and M the atomic mass; 
and ^ are the annihilation and creation operators for phonons in the states k, a. 
For operators having the form (127.5), Wick’s theorem holds; the average product of 
any even number of operators is equal to the sum of all possible products of pair means 
(the average product of an odd number of operators is zero).t An essential result here is 

<exp £> = exp(i<£^ », (127.6) 

which is valid for any operator L satisfying Wick’s theorem; its correctness is easily seen by 
expanding exp L in series and averaging each term in the expansion.^ 

Applying (127.6) to (127.4), we obtain 

<«b> = (^’b/i’)exp{-T<(b-u)^>}. 

The diffraction cross-section is proportional to the square of this quantity, and its 
temperature dependence is therefore given by the separate factor 

D = exp{-<(b-u)">}, (127.7) 

called the Debye Waller factor (P. Debye, 1912; I. Waller, 1925). 

It remains to calculate the mean square <(b-u)^>. Of all the products of pairs of 
operators the only non-zero mean values are 

= <4c<4/> = iVb. + i, 

where N^_^ are the mean phonon state occupation numbers in equilibrium. Hence 


<(b-u)^> = X 


^^IMNcofky 
h 




The numbers N. 


e given by the Bose distribution 

^koc = [exp{ba.„(k)/7’} - 1]“£ 


(127.8) 


t The theorem is proved in SP 2, §13, for the “macroscopic limit” (N -► co), corresponding to the application 
in statistical physics. 

t The product of an even number 2n of factors L can be resolved into pairs in (2n - l)(2n - 3). . . 1 ways 
(having chosen one of the 2n factors, we can “pair” it with any of the other 2n — 1 factors; then choosing one of the 
remaining 2n-2 operators, we can pair it in 2n - 3 ways and so on). Hence 

<P">= [(2n-l)(2n-3)... 1]<P>". 
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On substitution in (127.7), the term involving the zero-point vibrations gives a factor 
independent of temperature, which may be omitted (or rather, included in the definition of 
Ffc). Finally, changing from summation over k to integration over Vd^k/{2nf, we have 




d^/c ] 

^’(27r)^j- 


(127.9) 


As T -* 0, tends to zero, and accordingly D tends to unity; as the temperature 
increases, D becomes smaller. The effect of the temperature amounts to a general lowering 
of the scattered line intensity, with no change in the shape or width of the line. 

The mean thermal displacement of the atoms from the lattice sites is usually small, even 
at high temperatures, in comparison with the lattice constant. The exponent in D, for 
scattering lines with only slight changes in b, is then much less than unity, and the 
temperature effect is a small correction. The decrease in intensity becomes considerable, 
however, for scattering lines which correspond to large b. 

For temperatures much higher than the Debye temperature, < > cc T, and the exponent 

in D is also proportional to T. At low temperatures, the thermal phonons belong mainly to 
the acoustic branches of the spectrum, for which m oc /c; in this case, the integration over co 
in (127.9) can be extended to infinity, and the exponent is then proportional to T^. 

At low temperatures, when only the long-wavelength phonons are important, the 
validity of formulae (127.7)-(127.9) does not depend on the possibility of expressing the 
density n(r) as the sum over atoms (127.1). In long-wavelength vibrations, large regions of 
the lattice move as a whole with their respective electron density values, and only on this 
does the derivation of the formulae in question depend. 
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CURVILINEAR COORDINATES 


We give below, for reference, certain formulae relating to vector operations in curvilinear 
coordinates, both general and particular. 

In an arbitrary system of orthogonal curvilinear coordinates u^, Wj, "3, the squared 
element of length is d/^ = where the h, are functions of the 

coordinates. The element of volume is 

dV = hih2h^duidu2du^. 

The various vector operations can be expressed in terms of the functions h, as follows. 
For vector operations on a scalar: 


A/= 




^dui\ hi duj’ 


where the summation is over cyclic interchanges of the suffixes 1, 2, 3. For vector 
operations on a vector: 






The remaining components of curl A are obtained by cyclic interchanges of the suffixes. 
Cylindrical coordinates r, 4>, z. 

Element of length: d P = d d(^^ + dz^; 

K = 1 , \ = r, K = 1 . 


(■« 




divA = lAM.)-El^ 


dz ’ 


dA, 

~d^ 


SA^ 

dz 


Vector operations: 
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dA, dA, 

(curl A), = — -^, 


Id IdA, 

(curlAl = -^(rAJ-- — , 


(AA)^= A/1^-4 + ; 


r d<t> ' 

, A, 2 dA. 

(AA),= A4-^-^^. 

A^ 2 dA , 

^ d4> ’ 

(AA), = A A,. 

In the expressions for the components of A A, A A. signifies the result of the operator 
acting on A,- regarded as a scalar. 

Spherical coordinates r, 6, <f). 

Element of length: dF = dr^ + r^d6^ + r^ sin^ 6 d<j)^', 

#,^= 1 , hg = r, h^ = rsmB. 

Vector operations: 


= --( 

1 d 




dr 


_1_ d_ 

F sin 6 d6 


1 


df\ 1 SY 


Id Id 1 

div A - /l.) + ^ Ksin 

1 r^ 


1 dA, 1 d ^ , 


If 5 SArl 

(AA),= AA.-i[A, + ^ 

A 2rdA, Ag cosd dA^n 
( A)g= ^e + -^y-^-2sin^g- sin^e 




(AA)^= AA^ + 


J2 dAg A^ ~| 

sine L d<^ ^ dtp 2sin 6 J 
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113, 126, 130, 135 
Magnetostriction 144-7 
linear 176 

Mandel’shtam-Brillouin doublet 430 
Manley-Rowe theorem 373 
Maxwell effect 356 
Maxwell stress tensor 29 
Maxwellian relaxation time 432 
Maxwell’s equations 2, 105, 333, 399 
Mechanical-optical effects 355-7 
Meridional distribution 234 
Micromagnetism 157n. 

Momentum density 260 
Mutual inductance 119,210 


Magnetic anisotropy energy 136n., 138-40 
Magnetic axis 235 
Magnetic Bravais lattice 135 
Magnetic crystal classes 131,132-4 
Magnetic field 106 
boundary conditions 107, 200-1 
conducting fluid moving in 225-56 
conductor moving in 217-22 
forces on matter in 126-9 
static (IV) 105ff. 
of steady current 107-13 
thermodynamics of 113—17 
see also Electromagnetic field 
Magnetic flux 120 
Magnetic induction 105 


Natural optical activity 362-6 
of crystals 364-5 
Navier-Stokes equation 225, 228 
Nematic crystal 70 
Nemst effect 101 
Non-linear optics (XIII) 372ff. 
Non-local relation 358 
Non-uniformity energy 147 
Notation xii-xiii 


Ohm’s law 86, 213 
Optical activity, natural 362-6 
of crystals 364-5 




Index 


Optical activity vector 348 
Optical axis 339, 343 
circular 346 
singular 346 
Optical frequencies 265 
Optical ray axis 343 
Ordinary waves 340 
Orientational transitions 159-62 
Oscillator strength 282 


Parametric amplification 388 
Peltier coefficient 99 
Peltier effect 99 
Penetration depth 201-10, 301 
in a superconductor 180, 301 
Permeability see Magnetic permeability 
Permittivity 

analytical properties of 279-83 
of crystals 331-3 
dispersion of 1(A-1 
electrostatic 36 
at high frequencies 267-8 
longitudinal 361 
at low frequencies 267 
of a mixture 42-4 
non-linear 374-8 
tensor 54, 64, 347, 422 
transverse 361 
Phase velocity 236, 291 
Piezoelectric tensor 68, 70—2 
Piezoelectricity 55n., 65n., 67-74 
Piezomagnetism 176, 177 
Pinch 233 

Polariton range 369n. 

Polarizability tensor 7, 204 
Polarization 34 
coefficient 36 

of a dielectric in a variable field 266-7 
of electromagnetic waves in an anisotropic 
medium 337-8 
in geometrical optics 297 
in a gyrotropic medium 353 
total 297 

in a uniaxial crystal 340 
Ponderomotive forces 59 
Potential 
contact 93 
electric 

complex 11 
scalar 2-3, 399 
vector 11,108,120,399 
Powder method 446n. 

Poynting vector see Energy flux 
Principal dielectric axes 334 
Principal maxima 441 
Principal section 340 
Principal waves in waveguides 316 
damping of 317 
Pump wave 393 
Pyroelectricity 55-6 


Q factor of a resonator 311 
Quadrupole moment tensor 25 
of conducting ellipsoid 25—6 
Quality of a resonator 311 
Quasi-static fields (VII) 199ff. 


Raman-Landsberg-Mandel’shtam effect 428 
Raman scattering 428 
stimulated 391 -3 
Ray surface 335. 339^0, 343-4 
Ray vector 335 
Rayleigh scattering 428 
Reactance 21 In. 

Reciprocity theorem 309 
Reflection coefficient 295-300, 303—4 
Reflection point 305 
Refractive index 285, 290, 334n., 340 
Relativistic interactions 136,178 
Resistance 21 In. 

complex 210-14 
Resonators 310-13 
quality of 311 

Reynolds number, magnetic 229 
Rotation method 446n. 


Scale invariance 163, 172 
Scattering 413 

in amorphous solids 436—8 
anti-Stokes 413 
antisymmetric 417 
cross-section for 319 
of electromagnetic waves (XV) 413ff. 
by small particles 319-22 
spontaneous 421 
in liquid crystals 435-6 
plane 320 

principle of detailed balancing for 419-22 
Raman see Raman scattering 
Rayleigh, in gases and liquids 428-33 
scalar 417 

with small change in frequency 422-8 
spontaneous 421 
Stokes 413 
symmetric 417 

Second-harmonic generation 383-8 
Self-channelling 381 
Self-focusing 378-83 
Self-inductance 118 

of linear conductors 121-6 
of superconductors 183-5 
Shock waves in a magnetic fluid 245-53 
electromagnetic 389 
evolutionary 247 
fast 250 
parallel 251 
perpendicular 247 
slow 250 
switch-off 252 
switch-on 252 

Single-domain particles 157-9 
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Index 


Skin effect 122n., 208-10 
Solenoid 122, 125-6 
Solids, forces in 64-7 
Spatial dispersion (XII) 358ff. 

Sphere 

conducting 9-10, 14, 32, 57, 88-9, 205-6, 207, 
221 

dielectric 39-40, 42, 56. 57, 67. 263-4 
Spheroidal coordinates 
oblate 20 
prolate 20 
Spin flop 169 
Stewart-Tolman effect 223 
Stimulated emission 413n. 

Stimulated Raman scattering 391-3, 421-2 
Stopping power 396 
Stress tensor 60, 64, 65, 276-8 
Strong electromagnetic waves 388-91 
Sum rule 282 

Superconductivity (VI) I80ff. 
current 182-5 
destruction of 185 
transition point 180 
Superconductors 180ff. 
critical field in 185-9 
currents in 182-5 
disc-shaped 185 
ellipsoidal 193 
impedance of 301-2 
intermediate state of 189-98 
magnetic properties of 180-2 
multiply connected 183-5 
rotating 224 
self-inductance of 183-5 
thermodynamics of 185-9 
types of 180n. 

Susceptibility see Dielectric susceptibility; Magnetic 
susceptibility 

Synchronism condition 384 


Telegraph equation 318 
Tensor ellipsoid 55 
Thermodynamic inequalities 74-7 
Thermoelectric phenomena 97 
Thermoelectromotive force 100 
Thermogalvanomagnetic phenomena 101 
Theta pinch 234 
Thomson coefficient 99 
Thomson effect 99 


Thomson’s formula 215 
Thomson’s relations 100 
Thomson’s theorem 6 
Total polarization, angle of 297 
Total reflection 298 
angle of 297 

Transition radiation 408—12 
Transparency ranges 274 
Transparent media 286—9 
Transverse waves 314n. 

Turbulence in conducting fluid 253 
Turbulent dynamo 253-6 
Two-dimensional field 11 
Two-photon absorption 393 


Uniaxial crystals 55, 339—41 
negative 339 
positive 339 
Unipolar induction 220 


Vector potential 11, 108, 120, 399 
Velocity of light in a moving medium 292—3 


Waveguides 313 
Waves 

on a charged liquid surface 33 
electric-type see E waves 
electromagnetic see Electromagnetic waves 
extraordinary 340 

hydromagnetic see Hydromagnetic waves 
magnetic-type see H waves 
ordinary 340 
principal 316 
scattered 413 
shock see Shock waves 
Wave-vector surface 335, 339. 341-2. 350 
Wedge, diffraction by 323-7 
Wedge problem 12-13 
Work function 92 
Work potential 92 


X-ray diffraction in crystals see under Diffraction 
X-rays, diffuse thermal scattering of 447-9 

Zemplen’s theorem 246 
z pinch 233 
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